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A vote for science 


In support for science and environmental issues, Barack Obama and the Democrats have a clear 
advantage over Mitt Romney and the Republican Party. 


hen Americans vote for their next president in early 

November, they will select not just one man to occupy the 

White House, but also the thousands of party members 
who will sweep into Washington DC. Those appointees from the 
president’s political party will help to set priorities in science fund- 
ing, negotiate international treaties and decide whether to create new 
regulations — or ignore existing ones. 

This is important to consider when evaluating the two candidates 
because the challenger to President Barack Obama, Republican 
Mitt Romney, has not offered specific plans to manage the roughly 
US$65 billion of funding that goes to non-military research and 
development each year. The past positions of the candidates and the 
records of their own political parties make it clear that Obama and 
the Democrats offer important advantages for science over Romney 
and the Republicans. 

Budget proposals show the clearest differences. During his term, 
Obama has consistently supported strategic spending increases for 
science, with his most recent budget proposal calling for a 5% boost 
that targets physical sciences. Although some have been unhappy with 
the details — in particular the decision to forgo a funding increase for 
the US National Institutes of Health — Obama’ science portfolio has 
fared remarkably well during the financial crisis. 

Romney has voiced his support for science and said that he 
would make federally funded research a budget priority, but he 
has offered no specific details. Given Romney's pledge to reduce 
overall federal spending and to lower taxes, some scientists worry 
that key research agencies would experience sharp budget cuts. 
Last week, 68 Nobel science laureates said as much in an open 
letter of support for Obama. A Romney budget, the letter said, 
would “devastate a long tradition of support for public research and 
investment in science at a time when this country’s future depends, 
as never before, on innovation”. 


DOLLAR DRAIN 

It is notable that Romney chose Paul Ryan, a congressman from 
Wisconsin, as his vice-presidential nominee. Ryan this year offered a 
long-term budget plan that would slash spending for civilian research 
and development. 

The two parties diverge in past support for key areas of research. 
In 2009, Obama reversed an executive order by former president 
George W. Bush that had restricted government-supported research 
on human embryonic stem cells. Romney’s position on the topic 
remains unclear, although his shift over the past few years towards 
more conservative policies on abortion has worried supporters of 
human embryonic stem-cell research. To back such work, he would 
have to cross the hard line established by his own party. In August, 
Republicans issued a political platform that said: “We oppose federal 
funding of embryonic stem cell research?” And Ryan voted several 


times as a congressman to curtail federal funding in this area. 
Similar strong differences can be seen in positions on climate- 
change research and policies. When he took office, Obama reversed 
a trend of declining support for the US climate-research portfolio, 
and he wants a further 5.6% increase in 2013. He authorized the EPA 
to set limits on greenhouse-gas emissions from cars. And Obama's 
team has engaged in the United Nations’ climate negotiations to the 
extent possible given congressional constraints. The administration 
has taken a leadership role in the Montreal 


“Romney ’s plans Protocol to protect the ozone layer, as well 
donotbodewell as inthe Climate and Clean Air Coalition, 
for US scienceor  aninternational effort launched in February 
itsinternational to reduce pollutants such as methane, black 
partners.” carbon and tropospheric ozone, which warm 


Earth’s surface. 

Romney has diverged with many in his party by acknowledging 
that humans have helped to warm the planet, but he has exag- 
gerated the disagreements among climate scientists. According 
to Romney: “There remains a lack of scientific consensus on the 
issue — on the extent of the warming, the extent of the human 
contribution, and the severity of the risk — and I believe we must 
support continued debate and investigation within the scientific 
community.” He has campaigned against international climate 
agreements and last month told the television programme Meet 
the Press that “I’m not in this race to slow the rise of the oceans or 
to heal the planet”. 

His running mate has gone further to challenge climate scientists. 
After the release of e-mails in 2009 from the University of East Anglia 
in Norwich, UK, Ryan wrote: “These e-mails from leading climatolo- 
gists make clear efforts to use statistical tricks to distort their findings 
and intentionally mislead the public on the issue of climate change.” 
He also joined nearly all Republicans in the House of Representatives 
in 2011 to try to prevent any US funding of the Intergovernmental 
Panel on Climate Change. Romney and Ryan have vowed to roll back 
some of Obama’ climate policies and would strip the EPA ofits power 
to regulate greenhouse gases. 

They have also promised to cut back sharply on new regulations 
and to require Congress to approve any major ones, which would 
effectively block agencies from setting important new rules. That 
requirement is so limiting that it will probably not come to pass, but it 
has helped to foster extreme views about government that could keep 
science agencies from carrying out their missions should Republicans 
sweep the November elections. 

Over the past four years, Obama has demonstrated strong support 
for science and innovation, as well as policies that flow from research. 
Romney has not offered many details of his plans for science, but those 
he has released — and the recent record of his party — do not bode 
well for US science or its international partners. = 
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Banish cronyism 


A Texas cancer -research fund needs big fixes 
to restore badly eroded trust. 


hat a window into reality one misdirected e-mail can 
W erie Jimmy Mansour, head of the governing board 
at the Cancer Prevention and Research Institute of Texas 
(CPRIT) in Austin, mistakenly hit ‘reply all’ on a missive he sent last 
week, celebrating the resignations of many of the institute's scientific 
peer reviewers. The e-mail, first publicized by the Houston Chronicle, 
makes clear that Mansour, an accountant, lawyer and telecommunica- 
tions entrepreneur, views scientists and scientific review with breath- 
taking disdain. “Better to get them all out of the way now,’ Mansour 
wrote following yet another of the letters of resignation tendered to 
the CPRIT in recent weeks by at least 29 of the institute's first-rate 
scientific reviewers. The letters were written in solidarity with the very 
public departure from the CPRIT of Nobel laureate Alfred Gilman, 
who until two weeks ago was the institute's chief scientific officer. 
Gilman had given up on a review process that he labelled as corrupted 
by cronyism and unduly influenced by “really evil” political appointees 
on the CPRIT’s governing board, motivated by regional Texas rival- 
ries and personal interest. For instance, several grants to scientists at 
the University of Texas Southwestern Medical Center at Dallas, easily 
the best biomedical research institution in the state, were shelved by 
CPRIT leaders early this year despite strong endorsement from scientific 
reviewers. At the same time, a business-incubator grant was speedily 
awarded without scientific review: US$18 million went to the University 
of Texas MD Anderson Cancer Center and $2 million to Rice Univer- 
sity, both in Houston. After Gilman and the chairs of all eight of the 
CPRIT’s scientific-review panels complained, the incubator grant was 
pulled back, and the institute said that a scientific-review panel and 
commercial reviewers would all need to agree that it passed muster. It 
hired a compliance officer and, in August funded the shelved grants. 


An outside observer could have been forgiven for surmising that 
with these last three actions, the institute was mending its ways and 
beginning a new chapter. But then, as we report on page 459, came 
Mansour’s e-mail, written to CPRIT executive director William 
Gimson on 14 October, two days after Gilman’s departure. 

The exodus of scientific reviewers “gives us the prime opportunity to 
announce a new regime’, wrote Mansour. “There will be a number of 
Texas Institutions who will be ecstatic.” If the CPRIT concentrates on 
positive messaging at a conference this week, he added, “Gilman and 
the regime of the old guard (of research) will get the message” that the 

institute is moving forward. Forward towards 


“The CPRIT’s what? His words give little confidence that 
leadership must _ the goal isa process that will judge science 
change at the purely on its merits. 

very highest Mansour would make an almost comic 
levels.” villain were the stakes not so high. He is 


responsible for the $3 billion that more than 
60% of voting Texans approved in 2007 to fight cancer over ten years; 
his term at the head of the governing board does not expire until 2015. 
Yet it is clear that, if the institute hopes to recruit independent peer 
reviewers of the calibre that have just departed in droves, its leadership 
must change at the very highest levels. Mansour’s removal is essential; 
at least one ethicist has suggested that Gimson should go as well. Such 
housecleaning is also the only way to begin rebuilding the trust of the 
Texas public, which has every right to expect that the $2 billion as 
yet unspent by the CPRIT be awarded through unimpeachable peer 
review. After all, when they voted five years ago, Texans authorized the 
state to borrow $3 billion “for research in Texas to find the causes of 
and cures for cancer”. They did not vote for awards to bypass scientific 
scrutiny, for cronyism or for the equal distribution of grants between 
Texas institutions, regardless of the quality of proposed projects. 
Happily, the citizens of the state still have some leverage. A vote of 
the state legislature is required every year to issue the annual $300 mil- 
lion in funding that supports the CPRIT. Texans should insist that their 
state politicians make future funding contingent on a serious clean-up, 
and on the re-implementation of top-calibre peer review for every 
single project that the CPRIT funds. m 


Shock and law 


The Italian system’s contempt for its scientists 
is made plain by the guilty verdict in L Aquila. 


ian public prosecutor Fabio Picuti told Nature last year. He was 

speaking as the manslaughter trial began in the ruined town of 
LAquila of six scientists and one government official for their alleged 
role in the deaths of 309 people in the quake of April 2009 (see Nature 
477, 264-269; 2011). On Monday evening, the seven were found guilty 
and sentenced to six years in prison (see Nature http://doi.org/jkp; 
2012). The verdict is perverse and the sentence ludicrous. Already 
some scientists have responded with warnings about the chilling effect 
on their ability to serve in public risk assessments. 

Even Picuti was surprised. He had requested a prison term of four 
years. “We'll have to read the judge's motivations to understand why,’ 
he said. Under Italian law, judge Marco Billi has up to three months 
to reveal his reasoning. 

Despite the way the verdict has been portrayed in the media as an 
attack on science, it is important to note that the seven were not on trial 
for failing to predict the earthquake. As members of an official risk 
commission, they had all participated in a meeting held in LAquila 
on 31 March 2009, during which they were asked to assess the risk of 


cc |: not crazy. I know they can’t predict earthquakes,” the Ital- 
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a major earthquake in view of the many tremors that had hit the city 
in the previous months, and responded by saying that the earthquake 
risk was clearly raised but that it was not possible to offer a detailed 
prediction. The meeting was unusually quick, and was followed bya 
press conference at which the Civil Protection Department and local 
authorities reassured the population, stating that minor shocks did 
not increase the risk of a major one. 

According to the prosecutor, such reassurances led 29 victims who 
would otherwise have left LAquila in the following days to change 
their minds and decide to stay; they died when their homes collapsed. 
The prosecutor thus reasoned that the “inadequate” risk assessment of 
the expert panel led to scientifically incorrect messages being given to 
the public, which contributed to a higher death count. 

The seven — Bernardo De Bernardinis, Enzo Boschi, Giulio Sel- 
vaggi, Franco Barberi, Claudio Eva, Mauro Dolce and Gian Michele 
Calvi — are appealing against the verdict. They will remain free until 
the appeals process is finished, which could take years. 

That provides an opportunity. There will be time enough to ponder 
the wider implications of the verdict, but for now all efforts should 
be channelled into protest, both at the severity of the sentence and at 
scientists being criminalized for the way their opinions were commu- 
nicated. Science has little political clout in Italy and the trial proceeded 
in an absence of informed public debate that 
would have been unthinkable in most European 
countries or in the United States. Billi should 
promptly explain his decision, and the scientific 
community should promptly challenge it. m 
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0 R L D V EW A personal take on events 


thing has changed. The online reference tool now includes 

an entry for Eleanor Maguire, the cognitive neuroscientist at 
University College London who showed that the brains of London’s 
taxi drivers change as the drivers develop their knowledge of the 
city’s streets. And Louise Johnson, a molecular biophysicist at the 
University of Oxford, UK, from 1990 until her death last month, now 
has a longer entry more worthy of her achievements. The changes 
are welcome, but overdue. 

Around this summer’s Olympic and Paralympic Games I heard 
much discussion, some rather surprised, about the excitement and 
excellence of women’s sports. Suddenly, people realized that women 
could excel at everything from football to cycling, from rowing to 
boxing — and be thrilling to watch. Despite 
years of work to address sexism, women in 
science tend to be the equivalent poor rela- 
tion. Although we are known to lurk in there 
somewhere, on occasion women are still seen 
as add-ons who need to be patted on their 
pretty little heads, not always taken seri- 
ously and disadvantaged in pay and in other 
resources. 

Indeed, a study published last month in 
the Proceedings of the National Academy of 
Sciences showed that for identical CVs submit- 
ted under a male or female name, the women 
were rated as significantly less competent and 
hireable than men — irrespective of the sex of 
the evaluator — and there were notable differ- 
ences in the salary recommended (C. A. Moss- 
Racusin et al. Proc. Natl Acad. Sci. USA http://dx.doi.org/10.1073/ 
pnas.1211286109; 2012). Unconscious bias is still prevalent, even if 
overt discrimination is rare. 

Following on from a successful ‘Edit-a-thor’ earlier this year in 
collaboration with the Smithsonian Institute in the United States, 
last Friday Wikipedia held a similar event in conjunction with the 
Royal Society in London, to improve poor entries for some notable 
women scientists. 

The Royal Society’s library made its extensive resources available, 
to help to ensure that the new and improved entries are not only 
comprehensive but also accurate, something not always guaranteed 
on the Internet. This has to be a step in the right direction. But it 
is only one small step; there is a great deal of work to be done to 
ensure that female scientists and their important contributions are 
recognized appropriately, both on Wikipedia 


T ake a look at Wikipedia this week and you will find that some- 


and more generally. > NATURE.COM 
The latest Edit-a-thon was timed to tiein with —_ Discuss this article 

Ada Lovelace Day. Lovelace is often remem- _ online at: 

bered just as the daughter of the poet Byron, __go.iature.com/qofi9m 


DESPITE YEARS OF 
WORK TO ADDRESS 
SEXISM, 


WOMEN 


IN SCIENCE TEND TO 
BE THE 


POOR RELATION. 


Throw off the cloak 
e e e td e 
of invisibility 
Improving Wikipedia entries for notable women scientists should be only the 
start for a higher profile for women in science, says Athene Donald. 


rather than as a mathematician who made important contributions 
to the field that would become computer science, including what could 
be considered the first computer program, to be used in conjunction 
with Charles Babbage’s ‘difference engine. She does, at least, have a 
comprehensive Wikipedia entry. 

By raising the profile of some women scientists and by training edi- 
tors to be sensitive to the fact that such women are under-represented 
(in both number and length of entry) on Wikipedia, we can hope 
for a steady improvement in how women scientists are portrayed. 
Young aspiring scientists are very likely to use Wikipedia to gather 
information about past and present scientists. To help them to see 
what life might be like if they did pursue a career in science, we need 
realistic entries that provide valuable insight about living as well as 
dead scientists. 

I sincerely hope that this edit-a-thon will 
encourage more people to overhaul the online 
portfolio of women scientists’ entries. The 
number of female Fellows of the Royal Society 
is small, although growing, so it should be a 
reasonable aspiration for us in the United King- 
dom to give each onea decent write-up. Pll put 
my hand up and admit that as one such female 
fellow, I do have a rudimentary entry, although 
it is not particularly informative or inspiring in 
its current state. 

But we fellows are just one tiny band oflargely 
academic scientists. We need entries for more 
women scientists, and for technologists too, 
with handsome photographs and some context 
for their work and lives. I have a long-standing 
interest in this issue. I sit on the Royal Society’s Equality and Diversity 
Network and I chair the Athena Forum, a national group dedicated to 
improve the progression of women in science, technology and medi- 
cine. And I blog. 

We must continue to raise the profile of women in science, and 
remind people — as we did with the Olympic sports — that women’s 
contributions are every bit as important as men’s. 

We need to do this for a variety of reasons. First, to inspire young 
women currently considering their options at school; second, to pro- 
vide role models for women already studying science but uncertain 
whether they should pursue it further; and third, for simple natural 
justice. Women have, throughout the centuries, made contributions 
to the scientific endeavour. Historically, their opportunities were dire 
and their numbers small. Today the situation is very different, but 
this is still not a problem solved. = 


Athene Donald is a professor of physics at the University of 
Cambridge, UK. 
e-mail: amd3@cam.ac.uk 
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Selections from the 
scientific literature 


RESEARCH HIGHLIGHTS 


B. BLAND/NATURE PL 


BIOCHEMISTRY 


EPO made 
from scratch 


The hormone that triggers 
red blood cell production, 
erythropoietin (EPO), is often 
given to patients undergoing 
treatment for kidney failure 
or cancer. A team at the 
Sloan-Kettering Institute for 
Cancer Research in New York 
city has devised a method to 
synthesize a pure form of this 
complex protein from scratch. 
Samuel Danishefsky and his 
colleagues built the protein, 
which until now could be made 
only in cell culture, by piecing 
together four glycopeptides 
that they had assembled in 
the lab. The researchers then 
folded the amino-acid chain 
into the final protein and used 
mass spectroscopy to verify 
the structure. Umbilical- 
cord blood-progenitor cells 
that were cultured with the 
synthetic molecule formed red 
blood cells. 
Angew. Chem. Int. Edn. 
http://dx.doi.org/10.1002/ 
anie.201206090 (2012) 


Protecting prey 
with their odours 


Exposing wild black rats to 
the smell of bird prey seems 
to put them off prey that 
is introduced later on — a 
strategy that could prove 
useful in species conservation. 
To simulate bird nesting 
odours, Catherine Price and 
Peter Banks at the University 
of Sydney, Australia, placed 
feathers and faeces from 
quail (Coturnix 
coturnix 
japonica) in 
the habitat 
of the wild 
black rat 
(Rattus 
rattus; 


How to move a 4-tonne statue 


The Polynesian settlers of Easter Island may 
have transported their gigantic statues by slowly 
rocking them from side to side to make them 


‘walk. 


Nearly 1,000 statues litter the island, with the 
largest weighing about 74 tonnes and standing 
more than 10 metres tall. Some archaeologists 
propose that the statues, or moai, were 
transported from the quarry in a horizontal 
position on top of logs. However, Carl Lipo at 
California State University, Long Beach, and 
his team say that evidence points instead to an 
upright mode of transportation. Broken moai 
that were found along roads sloping upwards 


pictured). Seven days later, 
the researchers introduced 
artificial nests containing 
quail eggs. In areas where 
rats had been pre-exposed 
to the nesting odours, quail- 
egg survival was 62% greater 
than in areas where rats 

and eggs were introduced 
simultaneously. 

The authors suggest that, 
during the pre-exposure 
period, the rats learned to 
ignore the odour cues because 
they were not associated with 
an egg reward. 

Proc. Natl Acad. Sci. USA 
http://dx.doi.org/10.1073/ 
pnas.1210981109 (2012) 
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mostly lay on their backs, whereas those 
discovered on downwards sloping roads tended 
to be lying face down. 


The researchers built a 3-metre-high concrete 


com/1qoups 


PLANETARY SCIENCE 


Moon spun off 
from Earth 


A catastrophic collision 
between Earth and another 
body probably created the 
Moon. Computer models 
predict that the Moon was 
derived from the impacting 
body, and yet Earth and the 
Moon are chemically almost 
identical. Two new models 
show how this scenario could 
arise if Earth was spinning 
faster at the point of impact 
than it does today. 

One model, by Robin 


scale model, which they say has the same 
physical properties as a moai. Using three hemp 
ropes, a team of 18 people was able to move 

the statue 100 metres in 40 minutes (pictured). 
However, others are sceptical of the findings, 
saying that the shape of the model is inaccurate. 
J. Archaeol. Sci. http://dx.doi.org/10.1016/ 
j.jas.2012.09.029 (2012) 

For a longer story on this research, see go.nature. 


Canup of the Southwest 
Research Institute in Boulder, 
Colorado, suggests that if 

the impactor was larger than 
previous models have allowed 
— perhaps even Earth-sized 
—a.collision could create a 
planet and a disk of Moon- 
forming material, both with 
the same composition and 
made from the impactor and 
its target. 

Meanwhile, Matija Cuk 
and Sarah Stewart at Harvard 
University in Cambridge, 
Massachusetts, show that if 
Earth used to spin faster than 
it does today, even a small, fast 
impactor could cause Earth 


T. L. HUNT 


P. FENN GALLERY/GETTY 


to shake off a disk of material 
that could subsequently form 
the Moon. 

Science http://dx.doi. 
org/10.1126/science.1226073; 
http://dx.doi.org/10.1126/ 
science.1225542 (2012) 


BIOMATERIALS 


Super-reflective 
fish skin 


Three species of silvery fish 
seem to have found a way 
around a law of physics that 
governs the reflection of 
light. 

The skin of the Atlantic 
herring (Clupea harengus), 
European sardine (Sardina 
pilchardus; pictured) and 
sprat (Spratus spratus) is 
made up of alternating layers 
of cytoplasm and highly 
reflective crystals of guanine 
—a molecule that is also 
found in DNA and RNA. 
Nicholas Roberts and his 
colleagues at the University 
of Bristol, UK, report that 
the fishes’ skin can reflect 
light without polarizing 
it, even when the light hits 
the reflectors at angles that 
would normally result in fully 
polarized reflections. The 
skin contains a mixture of two 
types of guanine crystal with 
different optical properties — 
when the two are present in 
a specific ratio, this mixture 
prevents polarization and 
maintains high reflectivity. 

These reflectors help 
the fish to camouflage 
themselves by matching the 
light environment of the 
open ocean, say the authors. 
Moreover, the principles at 
work in these fish could have 
applications in optical devices 
such as light-emitting diodes. 
Nature Photon. http://dx.doi. 
org/10.1038/nphoton.2012.260 
(2012) 

For a longer story on this research, 
see go.nature.com/ndcqjt 


| CANCER 
Targetinga 
vicious cycle 


Cancer cells proliferate 

by sending the cell cycle 

into overdrive, but early 
attempts to target the cell 
cycle with drugs were marred 
by problems with toxicity. 
Now two groups show that 
shutting down cell-cycle 
proteins called D-type cyclins 
seems to stop tumour growth 
without affecting normal 
tissue. 

Piotr Sicinski and his 
colleagues at the Dana-Farber 
Cancer Institute in Boston, 
Massachusetts, engineered 
mice in which the production 
of cyclin D1 can be switched 
on and off. Loss of the protein 
had no apparent effect on 
healthy adult mice, but halted 
the growth of mammary 
tumours in mice that were 
also genetically predisposed to 
developing breast cancer. 

Sicinski’s team, and 
Iannis Aifantis at the New 
York University School of 
Medicine and his colleagues, 
showed independently 
that loss of cyclin D3 either 
triggered the death of tumour 
cells, or prevented their 
growth, in mouse models 
and human cell lines of an 
aggressive form of leukaemia. 
Inhibiting downstream 
proteins called cyclin-D 
associated kinases also killed 
tumour cells. 

Cancer Cell 22, 438-451; 
452-465 (2012) 


ARCHAEOLOGY 


A more accurate 
carbon clock 


Determining the age of fossils 
and other ancient objects could 
become more accurate, thanks 
to measurements of radioactive 
carbon- 14 from a lake in Japan. 
Radiocarbon dating is based 
on the steady decay rate of 
carbon-14 in samples, and 
archaeologists calibrate this 
carbon clock by comparing 
the known ages of tree rings 
with their carbon ages. But the 
tree-ring carbon record goes 
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Po _EQSCIENCE, 
Sudden rupture in deadly earthquake 


€ HIGHLY READ 
on www.agu.org 
15 Oct-21 Oct 


A brutal secondary rupture may have been 
responsible for much of the devastation 
caused by the Wenchuan earthquake, 


which killed more than 69,000 people in 
Sichuan province, China, on 12 May 2008. 

A team led by Guohong Zhang of the China Earthquake 
Administration in Beijing used ground-acceleration data 
from a national network of earthquake-monitoring stations 
to reconstruct the spread of the 7.9-magnitude quake. 
Records from 26 local stations indicate that the main slip area 
—ahigh-friction patch of fault located 25 to 50 kilometres 
from the epicentre — resisted the seismic stress building 
up around it for about 30 seconds. The slip area then failed 
catastrophically in a sudden 12.5-metre jolt. 

Researchers have long hypothesized that an initial quake 
can trigger secondary ruptures. The study shows that this is a 
realistic scenario, the team concludes. 

Geophys. Res. Lett. http://dx.doi.org/10.1029/2012GL052516 


(2012) 


back only about 14,000 years, 
and less reliable marine records 
have been used to fill the gap. 
Christopher Ramsey at the 
University of Oxford, UK, 

and his colleagues extracted 
roughly 70-metre-long core 
samples from the bed of Lake 
Suigetsu. By counting the 
number of distinct sediment 
layers in the core — two layers 
have formed every summer 
and winter over the past 
52,000 years — the team was 
able to compare the ages of the 
layers with their carbon-dated 
ages. 

The more accurate 
carbon-14 record could help 
archaeologists to fine-tune the 
dates of key events, such as the 
coexistence of humans and 
Neanderthals. 

Science 338, 370-374 (2012) 
For a longer story on this research, 
see go.nature.com/pbv6ey 


White LEDs 
without glare 


A new luminescent material 
reduces the glare of white 
light-emitting diodes (LEDs) 
and could make them more 


practical for home lighting. 

A team led by Hisayoshi 
Daicho of Koito Manufacturing 
in Shizuoka, Japan, designed 
the material, which consists of 
chlorosilicates that have been 
implanted with europium 
ions. The phosphor absorbs 
violet light and re-emits itin a 
yellow shade. The researchers 
combined this phosphor with 
a blue one and then added a 
violet LED to create a system 
that emits white light with a 
uniform hue (pictured) and 
no glare. 

Nature Commun. http://dx.doi. 
org/10.1038/ncomms2138 
(2012) 
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SEVEN DAYS nescnnss 


POLICY 


Biodiversity cash 
Wealthy nations have agreed 
to double their support to 
developing countries for 
protecting biodiversity, to 
reach about US$10 billion 
annually by 2015. The 
pledge was made at the 
United Nations Convention 
on Biological Diversity in 
Hyderabad, India, which 
ended on 20 October. The 
host country said that it would 
contribute $50 million over 
two years for conservation. 


Egg freezing is safe 
Egg freezing is no longer an 
‘experimental’ procedure, 

says the American Society 

for Reproductive Medicine 

in Birmingham, Alabama, 
which issued new guidelines 
for the controversial practice 
on 22 October. That change in 
policy is expected to accelerate 
the growth of clinics that offer 
to freeze the eggs of women 
who face fertility-damaging 
treatment or who wish to delay 
having a baby. See go.nature. 
com/ugsflc for more. 


Biofuels rethink 


The European Commission 
launched an overhaul of its 
biofuels policies on 17 October, 
after criticism that the rules 
have contributed to rising 

food prices and concerns that 
biofuels may produce greater 
greenhouse-gas emissions than 
fossil fuels (see Nature http:// 
doi.org/bmssn7; 2011). The 
proposals retain a target that 
10% of transport fuels should 
come from renewable sources 
by 2020 — but set a 5% cap 

on food-based biofuels. 

See go.nature.com/xhgytb 

for more. 


Forest-code veto 
President Dilma Rousseff 
once more exercised her veto 
before finally signing off 

the long-delayed revision of 


Manslaughter verdict rocks seismology 


After a 13-month trial, six scientists and 
former government official Bernardo De 
Bernardinis (pictured) have been found 
guilty of manslaughter because of the way 
that they assessed and communicated risk 
before the earthquake that hit the Italian city 
of LAquila in April 2009, killing 309 people. 
The seven men were each sentenced to six 


Brazil’s forest code into law on 
17 October. She rejected nine 
provisions that, among other 
things, would have removed 
forest protection along rivers 
and slopes, and would have 
allowed lawbreakers to receive 
a blanket amnesty. The 
country’s National Congress 
initially passed a bill to roll back 
forest protection in April, and 
Rousseff then used her veto 
powers to alter that version and 
send it back to lawmakers. 


Badger cull stalled 
The British government 

on 23 October delayed a 
controversial cull of badgers 
(Meles meles) that has 
provoked years of heated 
debate among researchers, 
farmers and politicians. The 
cull was scheduled to start 
imminently as part of efforts 
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to control bovine tuberculosis, 
which badgers can transmit 

to cattle (see Nature 490, 
317-318; 2012) — but it will 
now take place no sooner than 
next summer. See go.nature. 
com/aizbax for more. 


India GM concerns 
Prospects for growing 
genetically modified 

(GM) food crops in India 
receded further when a 

panel of scientists called for 

a moratorium on all field 
trials in a report issued on 

17 October. The five-member 
panel had been appointed 

by India’s Supreme Court in 
May in response to a petition 
from anti-GM protesters. Its 
conclusions echo an August 
parliamentary report, but 

go further, calling for a 
re-examination of biosafety 


years in prison, and must pay compensation 
to the victims’ families, although the 
defendants will appeal the verdict. At the 

22 October hearing, a prosecutor said that the 
men’s reassurances led to the wrong messages 
being given to the public, which added to 

the death count. See page 446 and go.nature. 
com/5orvaa for more. 


data on GM crops already 
approved for field trials. See 
go.nature.com/kovfre for more. 


Stem-cell bid 

Tom Okarma and Michael 
West, former chief executives 
of biotechnology firm 

Geron, sent the company’s 
shareholders a letter bidding 
for its stem-cell assets on 

18 October. Geron, based 

in Menlo Park, California, 
spent more than a decade 
developing a spinal-cord- 
injury treatment derived from 
human embryonic stem cells 
and performed early clinical 
testing in 2010. But John 
Scarlett, the company’s current 
chief executive, shut down the 
programme last November, 
saying that Geron’s cancer 


F. MONTEFORTE/AFP/GETTY 


WALDRAPPTEAM 
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therapeutics are a better 
investment. See go.nature. 
com/tihbe6 for more. 


Diet-pill concern 
Europe's drug regulators 

have recommended 

against approving a diet 

pill recently cleared for sale 

in the United States. The 
European Medicines Agency’s 
Committee for Medicinal 
Products for Human Use said 
that it was concerned about 
cardiovascular and nervous- 
system side effects for the 
obesity drug made by Vivus 
of Mountain View, California. 
Vivus says that it will appeal 
the 18 October opinion. 


Battery bankruptcy 
A123 Systems, a leading 

US manufacturer of 
lithium-ion batteries for 
electric vehicles, filed for 
bankruptcy on 16 October. 
The firm, based in Waltham, 
Massachusetts, had been 
backed by a US$249-million 
grant from the US Department 
of Energy. Founded in 2001, 

it was a spin-off from work at 
the Massachusetts Institute 

of Technology in Cambridge. 
See go.nature.com/mkqsbc 

for more. 


} PEOPLE 
EU health fracas 


Europe's health commissioner, 
John Dali, has left his role 
amid fallout from a corruption 


TREND WATCH 


New cases of tuberculosis (TB) 


fell by 2.2% between 2010 
and 2011, the World Health 


Organization said in its annual 
report on the disease. But some 


3.7% of new TB cases are now 
multi-drug resistant. In some 
countries in eastern Europe 
and central Asia, multi-drug 
resistance occurs in more 
than 20% of cases (see chart). 


investigation. The European 
Commission said on 

16 October that Dalli resigned 
after the European Union 
(EU) anti-fraud office, OLAK 
looked into legislation on 
tobacco regulation. According 
to the commission, a tobacco 
company complained that 

a third party had suggested 
using contacts with Dalli 

to influence the proposed 
legislation. Dalli denies the 
allegations. See go.nature. 
com/zqplrx for more. 


|} RESEARCH 
Vostok lifeless? 


No native microbes were 
found by an early analysis of 
the ice on the drill used by 

a Russian team to penetrate 
Lake Vostok, a body of water 
buried deep under Antarctica’s 
ice, in February. The top layer 
of the lake seems to be lifeless, 
according to “very preliminary 
results” presented by Sergey 
Bulat of the Petersburg 
Nuclear Physics Institute 

in Gatchina, Russia, at the 
12th European Workshop on 
Astrobiology in Stockholm 

on 16 October. See go.nature. 
com/cpe12m for more. 


Iconic ibis shot 

A bird that had been hand- 
reared by researchers as part of 
a project to save a rare species 
of ibis was killed by poachers 
in Italy on 13 October. Goja 
(pictured), a northern bald 


ibis (Geronticus eremita), had 
been trained to migrate from 
a breeding area in Germany 
to wintering grounds in Italy 
and was the first bird in the 
project to fly back unaided to 
Germany in summer 2011. 
The bird has been extinct in 
the wild in Europe for nearly 
400 years. See go.nature. 
com/4buekb for more. 


UK animal research 
A poll released on 19 October 
by Ipsos Mori suggests that 
63% of people in the United 
Kingdom support animal 
experimentation where there 
is no alternative — a drop 
from 73% in 2010. Only 43% 
think that UK rules on animal 
experimentation are well 
enforced, down from 56% 

two years ago. In response, 
medical charities, drug firms 
and 15 universities issued a 
declaration affirming that 
their research involves animals 
only where other avenues are 
not possible, and said that they 
would develop transparency 
principles to win over the 


DRUG-RESISTANT TB SPREADS IN EASTERN EUROPE 


In seven former Soviet countries, a fifth or more of new tuberculosis 


cases are now multi-drug resistant. 


Because of incorrect diagnoses 
and a shortage of data in some 
places, even this is probably an 
underestimate. See go.nature. 
com/pytbbu for more. 


SEVEN DAYS | THIS WEEK | 


public to the need for animal 
research. See go.nature.com/ 
Ispaam for more. 


Chimp haven 

The US National Institutes 

of Health (NIH) said on 

17 October that it will send 

20 chimpanzees to permanent 
retirement in a federally 
funded sanctuary by August 
2013 — double the number it 
announced last month. The 
animals are among 110 NIH- 
owned chimpanzees that the 
agency is removing from the 
New Iberia Research Center in 
Lafayette, Louisiana. Officials 
at the 80-hectare Chimp 
Haven sanctuary in Keithville, 
Louisiana, say that they would 
like to accommodate all the 
animals, but need an extra 
US$2.55 million to build the 
necessary structures. 


EVENTS 


Phones and tumours 


Italy’s highest civil court has 
stated that mobile phones can 
cause brain tumours — to the 
dismay of medical experts who 
say no study has proven a clear 
causal link between health 
risks and mobile-phone use. 

Ina 12 October decision made 
public last week, the court 
ruled in favour of a commerce 
manager who claimed his 
tumour was a consequence 

of the heavy phone usage 
demanded by his job. See 
go.nature.com/bg6zly for more. 
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NEWSIN FOCUS 


Tighter shackles 
loom for SARS and H5N1 
research p.456 


project unleashes a sea of 
troubles p.458 


Iron-dumping 


Resignations 
expose tensions at Texas 
cancer institute p.459 


In political 
choices, does the body 
rule the mind? p.466 


President Barack Obama (left) and presidential candidate Mitt Romney often take opposing stances on government’s role in science and innovation. 


‘POLITICS 


High stakes for US science 


As the election nears, the opinions of the presidential candidates diverge over research. 


BY JEFF TOLLEFSON AND HEIDI LEDFORD 


Si" rarely takes the stage in US presi- 


dential campaigns. But as President 
Barack Obama and Republican chal- 
lenger Mitt Romney make their final bids for 
votes on 6 November, their sharply contrasting 
visions of the size and proper role of govern- 
ment have profound implications for science. 
In the face of a sluggish economy and com- 
petition from countries such as China, “we've 
got to make sure that we've got the best sci- 
ence and research in the world’, Obama said 
during his second debate with Romney, on 
16 October. Romney, previously a business 
executive and the former governor of Mas- 
sachusetts, responded with a refrain that has 
become a centrepiece of his campaign: “Gov- 
ernment does not create jobs.” The exchange 
captures differences that could directly affect a 


wide range of science-related matters for years 
to come, including research funding, energy 
development, environmental regulation and 
public health (see ‘In their own words’). 

In his campaign rhetoric and in the pol- 
icy decisions made during his first term as 
president (see Nature 487, 414-415; 2012), 
Obama has kept science near the centre of his 
economic plan for the United States. He has 
defended investment in 
education and in basic 
and applied research as 
essential — especially 
during times of austerity. 
By contrast, Romney has 
emphasized the need to 
scale back government 
and rein in the trillion- 
dollar deficits that have 
marked Obama’s first 


Visit Nature's 
election special: 


term. Romney's campaign affirms the federal 
government's importance in funding basic 
research, but asserts that commercial innova- 
tion belongs squarely in the private sector. 
“These are really important signals,” says 
David Victor, a public-policy expert at the 
University of California, San Diego. “It certainly 
does not appear that a Romney administration 
is going to put spending on research and devel- 
opment and education at the centre of their pri- 
orities, whereas Obama has made science and 
education an important part of his agenda.” 
For research advocates, Obama's stance puts 
him ata clear advantage. Despite generally tight 
budgets across federal granting agencies in 
recent years, “Obama has a strong track record” 
of investing in science, says Abby Benson, 
president of the Science Coalition, a non- 
profit organization based in Washington DC 
that represents 50 US research universities. 
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IN THEIR OWN WORDS 


Barack Obama (blue) and Mitt Romney (red) speak 
about science issues on the campaign trail. 


CLIMATE 


“Climate change is one of the biggest 
issues...we have to meet this challenge by 
driving smart policies that lead to greater 
growth in clean energy generation and result in 
a range of economic and social benefits.” 


“| oppose steps like a carbon tax or a cap- 
and-trade system ... Economic growth and 
technological innovation, not economy- 
suppressing regulation, is the key to 
environmental protection.” 


The bottom line: Romney opposes all climate regulations. Obama used regulatory powers to 
push through energy and climate regulations, and subsidies for low-carbon energy technology. 


EDUCATION 


“Now | want to hire another hundred thousand 
new math and science teachers and create 
two million more slots in our community 
colleges so that people can get trained 

for the jobs that are out there right now.” 


“| propose we grade our schools so parents 
... can take their child to a school that’s being 
more successful. | don’t want to cut our 
commitment to education, | want to make it 
more effective and efficient.” 


The bottom line: Obama often emphasizes science education. Romney focuses on streamlining 
the federal role in education while encouraging school choice and voucher programmes. 


ENERGY 


“| have supported an all-of-the-above 
energy approach that will allow us to take 
control of our energy future, one where we 
safely and responsibly develop America’s 
many energy resources.” 


“A crucial component of my plan ... is to 
dramatically increase domestic energy 
production and partner closely with Canada 
and Mexico to achieve North American energy 
independence by 2020.” 


The bottom line: Romney and Obama have tussled over who supports fossil fuels the most. The 
difference is that Obama continues to push to develop renewable sources for the long term. 


INNOVATION 


“lam committed to doubling funding 
for key research agencies to support 
scientists and entrepreneurs, so that 

we can preserve America’s place as 
the world leader in innovation.” 


“The promotion of innovation will begin on 
Day One, with efforts to simplify the corporate 
tax code, reform job retraining programmes, 
reduce regulatory burdens, and protect 
American intellectual property.” 


The bottom line: Obama deployed stimulus money to commercialize new technologies. Romney 
supports basic research, but leaves innovation and commercialization to the private sector. 


REGULATION 


“Smart rules can save lives 
and keep us safe, but there 
are some regulations 

that don’t make sense 

and cost too much.” 


“We must reduce the power of unaccountable 
regulators by requiring that all major 
regulations receive congressional approval and 
by imposing a regulatory cap that prevents the 
addition of new regulatory costs.” 


The bottom line: Romney would slow or stop regulation where possible. Obama moved to reduce 
unnecessary regulatory burdens, but also used regulatory powers to further his agenda. 


SPACE 


“Our goal is the capacity for people to work 
and learn and operate and live safely beyond 
the Earth for extended periods of time, 
ultimately in ways that are more sustainable 
and even indefinite.” 


“America has enjoyed a half-century of 
leadership in space, but now that leadership 
is eroding ... | will bring together all the 
stakeholders ... and define the pathway 
forward.” 


The bottom line: Under Obama, NASA has drifted into budgetary and mission limbo. Romney has 
underscored the problem but offers few specifics about what he would change. 


STEM CELLS 


“We will vigorously support scientists 
who pursue this research. And 
we will aim for America to lead 

the world in the discoveries it 
one day may yield.” 


“| have a deep concern about curing disease 
... but | will not create new embryos through 
cloning or through embryo farming, because 
that would be creating life for the purpose of 
destroying it.” 


The bottom line: Obama repealed limits on federal funding of stem-cell research. Romney has 
not specified his stance, but his religious views may stand in opposition to current policy. 
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> “Romney has talked about it, but we don’t 
know how that will translate into policy” 

On other issues, both candidates have 
shifted with the times. As governor of Mas- 
sachusetts, Romney ushered in a ‘climate 
protection pla’ in 2004 that sought to reduce 
greenhouse-gas emissions while fostering a 
transition to cleaner energy sources. But one 
year later, he baulked at joining the Regional 
Greenhouse Gas Initiative, a cap-and-trade 
system covering carbon emissions from utili- 
ties in the northeastern part of the United 
States. This year, on the campaign trail, Rom- 
ney has frequently pointed out that ‘global 
warming is not ‘America warming, meaning 
that the United States should not take action 
— and increase costs on US manufacturers 
and businesses — while carbon emissions are 
accelerating in China and India. Romney has 
said that he would work to reverse greenhouse- 
gas regulations instituted by the US Environ- 
mental Protection Agency under Obama. 


CLIMATE SWINGS 
For his part, after watching climate legislation 
wither in Congress in 2010, Obama has 
focused on increasing fossil-fuel energy pro- 
duction as well as boosting renewables. Much 
to the dismay of environmentalists, the Obama 
administration has supported shale-gas devel- 
opment and opened up new territory in the 
Arctic to drilling. Oil and natural-gas output 
are on the rise in the United States, which has 
translated into less reliance on foreign oil and 
a reduction in greenhouse-gas emissions as 
electric utilities switch from coal to natural gas. 
Roger Pielke Jr, an environmental-policy 
expert at the University of Colorado, Boul- 
der, says that Obama has been pushed so far 
towards the right by political realities in Wash- 
ington that the core of his energy policy looks a 
lot like Romney’s. “There are plenty of people 
who like Obama because of what he believes,” 
Pielke says, “but if you are a policy pragmatist, 
what really matters is not what people feel in 
their heart, but what their actual policies are” 
Nonetheless, Romney has vowed to make 
clear turns away from current energy policy. He 
has said that he would reverse a century of pub- 
lic policy and turn the management of federal 
lands over to individual states to hasten energy 
production. He has also repeatedly questioned 
the wisdom of subsidies for renewable energy 
production. In particular, Romney has made it 
clear that he would end the kind of public-pri- 
vate ventures that resulted in government loans 
to the California solar energy firm Solyndra, 
which filed for bankruptcy last year despite 
US$535 million in federal loan guarantees. 
Obama and Romney both say that they 
would bolster the scientific workforce through 
easier immigration policies for foreign-born 
graduates of university science and mathemat- 
ics programmes. Yet Mary Woolley, president 
of the advocacy group Research! America, 
based in Alexandria, Virginia, frets that neither 
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OTHER VOTES TO WATCH 


During the US elections on 6 November, all 435 seats in the House, 33 seats in the Senate and 11 state 
governorships will be contested. In many cases, the outcomes could affect science-related policies. 


Washington (5) WY) 0 © © 


Oregon W) 


® 


citrine 


Idaho 


Y 


Wyoming 


Colorado 


BALLOT MEASURES 


? 
? 
9 
9 


campaign has been specific enough about its 
commitment to biomedical research. “T think 
that’s concerning,” she says. “Experience 
shows that candidates who don't talk about an 
issue before they are elected are unlikely to be 
champions of that issue after they are in office.” 


DOLLARS AND CELLS 
Although Obama has a proven track record 
of support for biomedicine — he allocated 
$10.4 billion in economic stimulus funds to 
the National Institutes of Health (NIH) in 
2009 — he has flatlined the agency's funding 
in his latest budget request. Romney’s pledge to 
tighten government purse strings while boost- 
ing defence spending could force cuts to the 
NIH and to regulatory agencies such as the 
Food and Drug Administration, says Woolley. 
Supporters of research that uses human 
embryonic stem (ES) cells are particularly nerv- 
ous about what actions Romney might take ifhe 
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Agriculture California could become the first state to require labelling of genetically modified (GM) 
foods, and a North Dakota measure could make planting GM crops a constitutional right. 


Energy A Michigan measure would require 25% of electricity sold in the state to come from 
renewable sources. A California vote could divert more state taxes to green energy. 


Marijuana Arkansas, Massachusetts and Montana will vote on allowing or expanding marijuana for 
medical use. Colorado, Oregon and Washington may opt to legalize it altogether. 


Wildlife Pre-empting challenges by animal-rights advocates, Idaho, Kentucky, Nebraska and 
Wyoming will vote on constitutional guarantees for hunting and fishing in those states. 


becomes president. Under Obama's executive 
order of March 2009, which overturned restric- 
tions imposed by the preceding George W. Bush 
administration, the number of human ES cell 
lines available to federally funded researchers 
has soared from around 20 to 182. Some fear 
that those gains may be frozen or even lost with 
Romney in the White House. 

Romney’s statements about stem cells 
during the 2012 campaign have been vague 
enough to leave his policy intentions unclear, 
and the Romney campaign did not respond to 
Nature’s requests for clarification on the issue. 
But in 2011, Romney’s running mate, Paul 
Ryan, co-sponsored a bill that would guar- 
antee every fertilized embryo the right to life. 
That, combined with policy statements from 
the Republican platform and the tone of Rom- 
ney’s recent comments about abortion, “does 
not give us much reason to think he would sup- 
port stem-cell research as president’, says Sean 


| READ | 
The beluga | @ Brain scans during sleep can 
whale decode visual content of dreams 
that had go.nature.com/5tfo34 
something | @ Fossil scars capture dinosaur 
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go.nature. @ Core sample sends carbon clock 
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TIGHT RACES 
(D, Democrat; R, Republican) 


California (10th District) 

Jeff Denham (R), who voted 
against greenhouse-gas 
regulation, faces former 
astronaut Jose Hernandez (D). 


Florida (Senate) 

Two-term Senator Bill Nelson 
(D), a strong supporter of 
NASA, battles a challenge 
from Connie Mack IV (R). 


Missouri (Senate) 

Claire McCaskill (D), a stem- 
cell champion, duels Todd 
Akin (R), who supports right- 
to-life for human embryos. 


Montana (Senate) 

Denny Rehberg (R), tough on 
NIH officials during his tenure 
in the House, wants Jon 
Tester’s (D) Senate seat. 


9 


Washington (Governor) 
Jay Inslee (D), outspoken on 
climate, clean energy and 
environmental legislation, is 
aiming to lead the state. 


5) 


Tipton, vice-president of communications for 
the Coalition for the Advancement of Medi- 
cal Research in Washington DC, an umbrella 
group that supports government funding of 
research on human ES cells. 

Whoever wins the election on 6 Novem- 
ber will inherit both a sputtering economic 
recovery and severe spending constraints. A 
combative and nearly evenly divided Congress 
could hamper new policy initiatives regardless 
of who is president, which means that tight 
races for the House and Senate will have signif- 
icance beyond their local districts (see ‘Other 
votes to watch’). For US researchers, then, the 
future will depend not only on who wins the 
vote, but also on who can carry the momentum 
ofan election victory into cooperative action 
across the political spectrum. m SEE EDITORIAL P.445 


Additional reporting by Helen Shen and 
Meredith Wadman. 


The buzz about pesticides go.nature. 
com/uhttix 
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| NEWS IN FOCUS 


INFECTIOUS DISEASE 


Viral research 
faces clampdown 


US biosecurity concerns could harm collaboration. 


BY DECLAN BUTLER 


States have acted to tighten security 

surrounding research on two deadly 
pathogens. The move is intended to enhance 
public safety, but some fear that it may hamper 
research in the United States and abroad. 

Earlier this month, US health agencies 
added the SARS virus to its list of select agents 
— pathogens and toxins that it deems to have 
the “potential to pose a severe threat to public 
health and safety”. In addition, the agencies 
last week opened a public consultation about 
whether they should designate strains of HSN1 
avian influenza virus that are transmissible 
between mammals as “Tier 1’ select agents — 
a newly created class subject to the strictest 
levels of biosecurity. The consultation comes 
after the Intragovernmental Select Agents and 
Toxins Technical Advisory Committee advised 
the agencies that the virus could “potentially 
overwhelm the health care system”. 

The listing of the SARS virus as a select 
agent — along with two recently discov- 
ered haemorrhagic-fever viruses, Lujo and 
Chapare — means that dozens of US labs 
holding the virus have until 3 April to either 
upgrade the biosecurity of their labs or destroy 
or transfer their stocks. The US government 
estimates that 38 of the labs — 10 academic, 
22 commercial and 6 government labs — 
do not currently have the extra biosecurity 
measures required to carry out select-agent 
research, including stricter lab-access rules and 
vetting of staff. 

Shinji Makino, a SARS researcher at 
the University of Texas Medical Branch in 
Galveston, whose biosafety-level-3 (BSL-3) 
lab is one of the 38 affected, is “disappointed” 
by the move. His university owns the BSL-4 
Galveston National Laboratory, which has 
the highest biosafety containment rating and 
so is select-agent compliant, and Makino is 
now seeking lab space there to continue his 
research. 

The restrictions could interfere with shar- 
ing the virus or reagents between labs based in 
the United States or overseas, warns Michael 
Buchmeier, deputy director of the Pacific 
Southwest Center of Excellence for Biodefense 
and Emerging Infectious Diseases at the Uni- 
versity of California, Irvine. “The threat of 


Fess health agencies in the United 


criminal prosecution and severe penalties will 
have a chilling effect on the kinds of collabora- 
tive efforts that have characterized SARS work 
up until now,’ he says. 

For H5N1 researchers, the proposal to add 
mammalian-transmissible forms of the virus 
to the Tier 1 list is the latest twist in a year-long 
international storm that erupted over experi- 
ments by Ron Fouchier at the Erasmus Medi- 
cal Center in Rotterdam, the Netherlands, 
and Yoshihiro 
Kawaoka at the 
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that a few genetic 
tweaks allowed the 
avian virus to spread easily in ferrets, an ani- 
mal model used for research into human flu. 
They say that if this virus escaped from a lab 
it might cause a pandemic in humans. In the 
face of these concerns, flu researchers agreed 
last January to a self-imposed moratorium on 
such work, which has yet to be lifted. 

The US Department of Agriculture already 
lists H5N1 as a select agent on the basis of 


the risks to animal health. But labs handling 
Tier 1 select agents must take security pre- 
cautions that go beyond the already stringent 
requirements for handling select agents. These 
include instituting extra physical barriers, 
video surveillance and stricter staff vetting 
and monitoring. 

The Tier 1 subset of select agents was created 
following the FBI’s conclusion that a govern- 
ment scientist, Bruce Ivins, was responsible 
for the anthrax mailing attacks in 2001. It 
became effective on 5 October and includes 
the pathogens responsible for smallpox, 
Ebola, anthrax, foot-and-mouth disease and 
rinderpest (see “Top tier’). 

The Department of Health and Human Ser- 
vices and the Centers for Disease Control and 
Prevention proposed adding some forms of 
H5N1 to the list on 18 October, and the public, 
as well as scientists, have 60 days to respond. 
If the extra restrictions are imposed, they are 
likely to influence the way the research is regu- 
lated globally, because the United States is the 
largest funder of flu work. 

Not before time, claim the critics. Mamma- 
lian-transmissible H5N1 viruses definitely 
meet all the stated criteria for a Tier 1 select 
agent, given their potential deadliness, the 
lack of immunity in humans and the absence 
of sufficient drugs or vaccines to cope with a 
pandemic, says Richard Ebright, a molecular 
biologist and biodefence expert at Rutgers 
University in Piscataway, New Jersey. 

But Fouchier argues that by creating a burden 
that would keep all but a few labs from working 
on the virus, listing H5N1 asa Tier 1 agent could 
ultimately make the world less secure. “We need 
as many labs to do research as possible, as this 
virus remains a direct threat to animal and pub- 
lic health,” he says. “Restricting the research on 
this pathogen will not increase global safety and 
security; it will make things worse.” m 


The US Federal Select Agent Program this month created a new subset of 13 ‘Tier 1’ disease agents or 
toxins considered to be exceptional threats to security. Some forms of HSN1 may be added to the list. 


TOP TIER 

Agent Type 
Botulinum neurotoxin Toxin 
Ebola virus Virus 
Francisella tularensis Bacterium 
Marburg virus Virus 
Botulinum-neurotoxin- Bacterium 
producing species of Clostridium 

Variola major virus Virus 
Variola minor virus Virus 
Yersinia pestis Bacterium 
Bacillus anthracis Bacterium 
Burkholderia mallei Bacterium 
Burkholderia pseudomallei Bacterium 
Foot-and-mouth disease virus Virus 
Rinderpest virus Virus 
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Danger 

One of the most potent neurotoxins known 

Ebola haemorrhagic fever 

Tularaemia. Found in many animals, in particular rabbits 
Marburg haemorrhagic fever 


Toxin causes botulism food poisoning 


Most common form of smallpox 
Rarer and less lethal form of smallpox 
Pneumonic and bubonic plague 
Anthrax 

Glanders in humans and horses 
Melioidosis 

Foot-and-mouth disease 


Rinderpest 


TECHNOLOGY 
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Website pitches solutions in 
search of problems 


Marblar aims to help scientists turn discoveries into innovative applications. 


BY RICHARD VAN NOORDEN 


to develop high-tech products are tap- 

ping into the wisdom of crowds to solve 
problems, with crowdsourcing sites such as 
Innocentive and Kaggle offering cash prizes 
for answers to science or data questions. The 
launch this week ofa site called Marblar (www. 
marblar.com) is turning this model on its head. 

Marblar gives scientists a space to tout solu- 
tions that have yet to find their problem. Mem- 
bers, who can come from any background, are 
invited to publicly discuss potential uses for 
patented discoveries made in research labora- 
tories that as yet may not have led to real-world 
applications. 

Sites that help to match up technology devel- 
opers with potential investors — such as Yet2. 
com, iBridgeNetwork.org and ninesigma.com 
— do so in private. By contrast, every sugges- 
tion at Marblar is posted on a public forum 
alongside video interviews with the scientists 
and explanations of their work. Website visi- 
tors suggest applications and vote them up and 
down, and the scientists behind the discovery 
are encouraged to take part in the discussion. 
Popular suggestions are recognized with a 
points system (denoted by marbles — hence 
the name) and, in some cases, small cash prizes. 

The venture has attracted £371,000 
(US$600,000) of backing from IP Group, a 
technology-investment firm based in Lon- 
don. The site charges listing fees and gets prizes 
sponsored; if it takes off, it may offer recruit- 
ment services, or even get a small slice of royal- 
ties from spin-off companies. 

The website is the brainchild of entrepreneur 
and biochemistry doctoral student Dan Perez at 
the University of Oxford, UK. Perez argues that 
scientists seeking commercial use for their work 
don't consult widely enough. “So much science 
lies fallow because people pursue the wrong ave- 
nues,’ he says, noting that the search often stops 
at the door of the university technology-transfer 
office, where inventions are pushed towards a 
few narrow applications that may not bear fruit. 
Marblar, he hopes, will throw that consultation 
process open to the world. 

“The idea addresses an important problem,’ 
says George Whitesides, a famously inventive 
chemist at Harvard University in Cambridge, 
Massachusetts, who is not involved with 
the company. “There is all manner of really 


I: this age of social media, innovators eager 


Team Marblar: (from left) Mehmet Fidanboylu, Gabriel Mecklenburg, Daniel Perez and Tom von Erlach 
hope that their website will open fresh pathways to success for inventions. 


interesting science that has not been exploited” 
However, he points out that the cost of turning 
an idea into a practical product is the biggest 
barrier to commercialization. Furthermore, 
the best applications are likely to have found 
backers already. “Having said that, what this 
kind of idea might do is to stimulate thinking 
about really new and unconventional types 

of applications and 


“Marblar is products,” he says. 

trying to open up Perez says that 
questions about = Marblar users will 
what you dowith _ initially be discussing 


more than 30 tech- 
nologies offered by 
UKand US universities and research-funding 
agencies. These include optical tweezers that 
can deform and manipulate nanoparticles; a 
device that can transmit multiple data signals 
through power lines; and a way of bringing 
molecules inside cells by hitching them to 
antibodies attached to cell-invading viruses. 

So will Marblar work? “Like all these things, 
the idea depends on the people involved more 
than anything else — and Dan is quite persua- 
sive,” says Graham Richards, a former head of 
chemistry at the University of Oxford, who has 
founded numerous spin- 


inventions.” 


offfirms and now sitson NATURE.COM 

the board of IP Group. Read more about 
Successful technol- Daniel Perezand 

ogy transfer depends Marblar: 

on wide networking — go.nature.com/Ganjmf 


with the right people, adds Richards. Perez is 
well connected: in June 2011 he founded an 
industry discussion forum for young scientists, 
which has now become the Oxbridge Biotech 
Roundtable, and to which Marblar’s three 
other young founders also belong. 

Marblar has already had a trial run, using 
work by chemist Tom Brown from the Uni- 
versity of Southampton, UK, which involves 
a method to chemically join DNA and RNA 
together without using biological enzymes. The 
pilot was fairly successful, says Adam Irvine, who 
works in the university's technology-transfer 
office. “I was sceptical that much would come 
back from this,’ he says. “So it was interesting for 
me to see that we could get useful ideas.” 

Contributors to Marblar are made aware 
that they are effectively giving up their ideas 
for free by discussing them in public, but Perez 
notes that beyond the rewards the site provides, 
there may be a future for winning entries. For 
example, the architect of the top suggestion in 
Marblar’s pilot challenge is now in talks with the 
inventor to start a company to develop drugs 
that bind to RNA. 

“It’s very difficult to measure how successful 
crowdsourcing websites have been — of course, 
they will all trumpet successes for clients,” says 
Michael Hill-King, who leads knowledge- 
transfer schemes at King’s College London. 
But, he says, “Marblar is trying to open up ques- 
tions about what you do with inventions — and 
I hope it will cause some waves”. m 
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Ocean-fertilization project 
off Canada sparks furore 


Bid to boost salmon stocks relied on hotly debated science and dubious carbon credits. 


BY JEFF TOLLEFSON 


hen a chartered fishing boat strewed 
W « tonnes of iron sulphate into the 

ocean off western Canada last July, 
the goal was to supercharge the marine ecosys- 
tem. The iron was meant to fertilize plankton, 
boost salmon populations and sequester car- 
bon. Whether the ocean responded as hoped 
is not clear, but the project has touched off an 
explosion on land, angering scientists, embar- 
rassing a village of indigenous people and 
enraging opponents of geoengineering. 

The first reports about the project, which 
appeared in British newspaper The Guardian 
on 15 October, presented it as a rogue geo- 
engineering scheme — the largest in history 
— in “blatant violation” of international trea- 
ties. Critics suggested that Russ George, a US 
entrepreneur, had persuaded the Haida Nation 
village of Old Massett on the Queen Charlotte 
Islands to fund the project by promising that 
it would be possible to sell carbon credits for 
the carbon dioxide taken up by phytoplankton. 

The reality was much more complex, and 
it underscores the combustible politics and 
uncertain science of geoengineering. 

Contacted by Nature, George lashed out at 
the media and “radical environmentalists” for 
manufacturing a “racist” story about a maverick 
geoengineer taking advantage of naive natives. 
“This was their work and their project,” he says. 
“Tt is not the result of them being too stupid to 
know better” 

It is now clear that Old Massett, a fishing 
village of fewer than 1,000 people, embraced 
the project in hopes of restoring dwindling 
salmon runs by boosting phytoplankton and, 
in turn, the entire marine food web. Villagers 
voted in February 2011 to lend Can$2.5 mil- 
lion (US$2.5 million) to the Haida Salmon 
Restoration Corporation (HSRC) to fertilize 
the ocean, says John Disney, head of the Old 
Massett-based corporation and economic- 
development officer for the village. George, 
who previously headed Planktos, a firm based 
in San Francisco, California, that had sought to 
commercialize ocean fertilization using iron, 
signed on as chief scientist after the HSRC 
approached him, says Disney. The company 
planned to repay the village for its loan by sell- 
ing carbon credits to companies seeking to 
offset their greenhouse-gas emissions, he adds. 


Workers on a Haida Salmon Restoration Corporation boat release iron sulphate into the Pacific Ocean. 


“We created life where there wasn't life,’ says 
Disney, adding that the fertilization fed a phyto- 
plankton bloom of some 10,000 square kilome- 
tres, which attracted fish, birds and whales (see 
‘Sowing controversy’). “The only difference 
between what we've done and what everybody 
else has done is that we've taken it up a notch” 

In fact, the Old Massett scheme dumped 
five times more iron than previous fertiliza- 
tion experiments. And no scientists outside 
the project have seen data that might show 
whether it worked as advertised. “I’m not 
going to condemn it offhand, but this is just 
not the way to do this experiment,” says Vic- 
tor Smetacek, a marine biologist with the 


SOWING CONTROVERSY 

A company backed by a Canadian indigenous 
group has attempted to fertilize a region of the 
Pacific Océan important for salmon stocks. 


CANADA 
"old Massett 
Site of ocean a 
fertilization UNITED 
STATES 
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Alfred Wegener Institute for Polar and Marine 
Research in Bremerhaven, Germany. “It’s quite 
sophisticated science, and it would have been 
good if scientists had carried it out.” 

The project was also on uncertain legal 
grounds. Ocean fertilization is restricted by a 
voluntary international moratorium on geo- 
engineering, as well as a treaty on ocean pol- 
lution. Both agreements include exemptions 
for research, and the treaty calls on national 
environment agencies to regulate experiments. 
Officials from Environment Canada say that 
the agency warned project leaders in May that 
ocean fertilization would require a permit. 

“Environment Canada did not approve this 
non-scientific event,” environment minister 
Peter Kent told Parliament on 18 October. 
“Enforcement officers are now investigating” 
The Canadian National Research Council gave 
nearly Can$70,000 in funding to the HSRC, 
and the US National Oceanic and Atmospheric 
Administration provided 20 buoys to help to 
monitor water conditions. But officials at those 
agencies say they were never informed of the 
ocean-fertilization project, and they thought 
that the work involved salmon ecology. 

Jason Blackstock, a geoengineering expert at 
the University of Oxford, UK, says that the situ- 
ation highlights the grey area between geoengi- 
neering to alter global climate, and local actions 
with other goals such as boosting salmon stocks 
or seeding clouds for weather modification. 
“This has the potential to become a ubiquitous 


HSRC 


problem,” says Blackstock. 

The ETC Group, an advocacy organiza- 
tion based in Ottawa that has led a global 
drive against geoengineering, has sug- 
gested that George misrepresented the pro- 
ject’s potential to generate carbon credits. 
Documents from the Old Massett website 
imply that in raising funds, project lead- 
ers stressed the potential for easy carbon 
credits. One document, tied to a 2011 loan 
application, showed that bank managers 
were wary of the HSRC’s claims that the 
market for such carbon offsets was proven 
and that “retail outlets and banks in Ger- 
many are begging for the product”. 

In fact, carbon credits from iron-fertili- 
zation projects cannot be offered on formal 
markets such as the European emissions 
trading system, although willing buyers 
might be found outside those markets. And 
whether iron fertilization actually seques- 
ters carbon is uncertain. A study' by Smeta- 
cek published in July — based on analysis of 
an experiment in 2004 — found that at least 
half of the carbon taken up by the iron-fer- 
tilized plankton was buried after they sank 
to the bottom of the sea. But other stud- 
ies’ have found that carbon in the blooms 
remains in the active biological cycle and is 
not sequestered at all. 

George says that the bankers were ulti- 
mately satisfied, and that carbon credits are 
no more than a possible source of future 
funding, if the science supports them. But 
in an initial interview, Disney repeatedly 
said that the company needs to sell the car- 
bon credits quickly to repay its loan from the 
community. “Being the guy who sold this to 
the community, I bloody well better come 
up with the money,” he said. Disney later 
backed offhis emphasis on carbon credits 
and stressed that he stands by George. 

It is unclear whether the project will 
restore the salmon. A bumper run of sock- 
eye salmon (Oncorhynchus nerka) in 2010 
came two years after a volcanic eruption in 
Alaska sent a layer of iron-rich ash over the 
ocean, fertilizing a plankton bloom’. But 
many scientists remain sceptical. 

Whether the Haida experiment worked 
wont be known for two years, when the 
youngest of the salmon feeding in the ocean 
today return home to spawn. John Nightin- 
gale, president of the Vancouver Aquarium 
in Canada, says that will be a chance to 
glean some science from the project. The 
work may have lacked scientific rigour, 
he says, but the HSRC has now agreed to 
make all of its data available to scientists. It 
has “done something unique’, Nightingale 
says. “I want the maximum information, the 
maximum analysis, the maximum debate.’ m 


1. Smetacek, V. et al. Nature 487, 313-319 (2012). 

2. Boyd, P. W. et al. Nature 407, 695-702 (2000). 

3. Jones, N. Nature http://dx.doi.org/10.1038/ 
news.2010.572 (2010). 
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William Gimson of the Cancer Prevention and Research Institute of Texas is on the hunt for a chief scientist. 


Texas cancer fund 
seeks fresh start 


Critics question whether institute has resolved conflicts 
between commercial and scientific goals. 


BY MONYA BAKER 


last person in the job was a Nobel laureate 
who quit in protest over a disregard for 
peer review, and whose departure triggered an 
avalanche of other resignations. Nevertheless, 
the Cancer Prevention and Research Insti- 
tute of Texas (CPRIT) is confident that it will 
announce a chief scientific officer to replace 
Alfred Gilman, who departed on 12 October, by 
the end of the year. But institute watchers sus- 
pect that even an ideal candidate might not be 
able to dispel the science community's unease 
about the CPRIT’s attempt to simultaneously 
support basic research and nurture companies. 
“T can’t think of a better example than this 
one of how a potential conflict of interest can 
undermine an institution,’ says Paul Root 
Wolpe, a bioethicist at Emory University in 
Atlanta, Georgia. 
The Austin-based CPRIT was created in 2007 
when Texas voters agreed to a US$3-billion 


[ must be hard filling a position when the 


initiative that would spend $300 million a year 
to advance basic research, reduce cancer rates 
and nurture Texas companies. Since then, the 
state agency has awarded 427 grants totalling 
more than $750 million, with $574 million des- 
ignated for scientific research and the rest for 
commercialization and prevention. Its funding 

of innovative research has won accolades. 
Controversy erupted in May after Gilman, 
who won the 1994 medicine Nobel, tendered 
his resignation in a strongly worded letter 
criticizing a $20-million commercial ‘incu- 
bator’ grant that had been awarded without 
scientific review. Much of the grant was slated. 
for a group led by Lynda Chin at the Univer- 
sity of Texas MD Anderson Cancer Center 
in Houston, where Chin’s husband, Ronald 
DePinho, is president. 


> NATURE.COM CPRIT internal corre- 
Read an interview spondence that was sub- 
withaformerCPRIT sequently made public 
scientific reviewer; through freedom-of- 
go.nature.com/rxa4y! information rules > 
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> suggests that the grant criteria were tailored 
to improve Chin’s eligibility (see Nature 486, 
169-170; 2012). At the same time as the incu- 
bator grant was awarded, a set of grants recom- 
mended for approval by the CPRIT’s scientific 
council stalled. 

Gilman said that he would remain with the 
CPRIT until the autumn. Over the summer, 
the contentious $20-million grant was with- 
drawn for ‘re-review, and provisions were 
made for scientific review of all commercial 
grants. The previously sidelined grants were 
approved (as were all grants recommended by 
reviewers), and a compliance officer was hired 
to prevent submission irregularities. 

But the measures to uphold peer review have 
not won over the CPRIT’s critics, and many 
scientific reviewers, including at least seven 
of the eight heads of the institute's external 
scientific review panels, decided to depart 
along with Gilman. Their resignation letters 
accused the CPRIT of “hucksterism’ and of 
having “dishonored” the system of peer review. 
They also complained of unusual circumstances 
faced by reviewers evaluating commercial 
proposals, saying that CPRIT staff sent them 
additional information and invited them 
to revise unfavourable scores. The resigna- 
tion letters said that concerns they expressed 
were ignored, and Gilman himself suggests 
that CPRIT officials have been biding their 


time, “waiting for me to get out of the way”. 

An e-mail from Jimmy Mansour, a Texas 
businessman who heads the CPRIT’s govern- 
ing board, to his fellow committee members 
said that he looked forward to the appointment 
of MD Anderson's provost, Raymond DuBois, 
in Gilman’s place. “Gilman is gone and so is 
his influence,’ Mansour wrote last week in an 
e-mail made public by the Houston Chronicle 

on 18 October. 


“can’t think of Mansour later 
a better example told Nature that 
than this one of Gilman’s peer- 
how apotential review system was 
conflict ofinterest a “cornerstone” of 
canunderminean funding decisions. 


DuBois, for his part, 
says that he has not 
accepted a position at the CPRIT and is instead 
taking up a role at Arizona State University in 
Tempe. 

William Gimson, the CPRIT’s executive 
director, agrees that there were “process prob- 
lems” with the grants, but denies more seri- 
ous underlying issues. The concerns “mostly 
boil down to the natural conflict that exists 
between our scientific and commercialization 
portfolios’, he says. 

Such conflicts can undermine an institu- 
tion, says Hamilton Moses at Alerion Advi- 
sors, a consulting firm for research institutes 


institution. ” 


in North Garden, Virginia. “There are very 
few examples I can think of where adequate 
review has occurred when there has been an 
economic-development agenda.” 

Wolpe says that the CPRIT needs to take 
explicit action to restore trust between its 
overseers and its scientific reviewers. “The 
quickest way they can come back is through 
the resignation of the director, he says, a move 
that would signal a commitment to change. 
“Barring that, they need to put in a fully inde- 
pendent external review of their processes and 
procedures. And they need to announce very 
clear remedial measures.” Gimson says that he 
has no plans to resign and that concerns about 
scientific integrity are unfounded. 

Garnet Coleman, a member of the Texas 
House of Representatives, believes that the 
CPRIT’s problems originated with the legisla- 
tion that created the institute, which he says 
lacks appropriate internal safeguards and 
oversight by the legislature. “It was like some- 
body didn't know how to write the creation of 
a state agency, but they did anyway,” he says. 
“We need to take the rules, fix them and put 
them in statute.” He hopes to introduce a new 
measure next year. 

Gilman, now retired, says that ifthe CPRIT 
cannot fund the best science, “I would person- 
ally rather see the money not spent and go back 
to the taxpayers”. m SEE EDITORIAL P.446 
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Tht 
LEARNING 
CURVE 


Researchers say that some chemicals have unexpected and potent 
effects at very low doses — but regulators aren’t convinced. 


BY DAN FAGIN 


ear the end of an adventurous life spent wandering the 

fortress towns of central Europe, clashing with blood- 

letters and other tradition-bound healers of the day, the 

irascible sixteenth-century physician Paracelsus wrote a 
defence of his unorthodox use of mercury, opium and other potentially 
dangerous medicines. “All things are poison, and nothing is without 
poison: the dose alone makes a thing not poison,” he wrote. Centu- 
ries later, after many of his once-radical ideas found wide acceptance, 
Paracelsus's pronouncement would be distilled into a pithy phrase that 
became foundational dogma for the modern science of toxicology: 
“the dose makes the poison.” 

The contemporary interpretation of Paracelsus's famous declaration, 
for which he is often called the father of toxicology, is that dose and 
effect move together in a predictably linear fashion, and that lower 
exposures to a hazardous compound will therefore always generate 
lower risks. This idea is not just a philosophical abstraction; it is the 
core assumption underlying the system of chemical-safety testing that 
arose in the mid-twentieth century. Risk assessors typically look for 
adverse effects of a compound over a range of high doses and, from 
there, extrapolate downwards to establish health standards — always 
assuming, like Paracelsus, that chemicals toxic at high doses are much 
less risky at lower, real-world levels. 
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But what if the Paracelsian presumption is wrong? What if, for a 
large and potent class of compounds, lower doses pose higher risks? 
A growing number of academic researchers are making just such a 
claim for endocrine disrupters, a large group of synthetic chemicals 
able to interact with cellular hormone receptors. These compounds, 
which range from the common weed killer atrazine and the plasticizer 
bisphenol A (BPA) to the antibacterial agent triclosan (used in cleans- 
ers) and the vineyard fungicide vinclozolin, dont play by the usual 
rules of toxicology. On the basis of conventional high-dose testing, 
regulators have set maximum acceptable levels for each of them that 
assume all doses below that level are safe. But academic researchers 
who have studied a wider range of doses, including very low ones 
found in the everyday environment, say that their experiments usu- 
ally do not generate the tidy, familiar ‘ski-slope’ dose-response graphs 
of classic toxicology. Instead, most endocrine disrupters have ‘non- 
monotonic’ dose-response curves, meaning that their slopes change 
at least once from negative to positive, or vice versa, forming UV 
shapes, inverted ‘U’s or even stranger shapes that resemble undulat- 
ing Chinese dragons (see ‘Curious curves’). 

“We're seeing that for every one of these compounds we test, there 
will be anon-monotonic response — every one!” says Frederick vom 
Saal, a neurobiologist at the University of Missouri-Columbia, who 
has been sounding the alarm about endocrine disrupters since the 
1970s. “Low doses of endocrine disrupters act in ways that are totally 
unpredicted by the traditional approaches of toxicology.’ Vom Saal 
_ and his colleagues believe that very low doses of these compounds in 
the environment are contributing to a wide range of human health 
_ problems — including obesity, diabetes, cancer, cardiovascular dis- 

ease, and infertility and other disorders related to sexual development. 
_ Many toxicologists, however, are not convinced — especially those in 
industry or government who have spent their careers deeply involved in 
traditional risk assessment. Although they acknowledge that endocrine 
disrupters have unusual toxicological quirks, they say that the work of 
vom Saal and like-minded researchers is still insufficiently replicated, 
_ too reliant on unvalidated assays and too focused on end points such as 

- organ weight, precancerous growths and changes in the activity of genes 
and proteins, which may not pose significant health threats. “If we're 
going to take this seriously, we need to have some evidence of a real 
phenomenon that happens not just in 
~ the hands of one researcher and one test, 
something repeatable that can stand up to 
scientific scrutiny about how it could lead 
~ to real health effects we want to avoid)’ 
says Lorenz Rhomberg, a toxicologist 
at the environmental-consulting firm 
Gradient in Cambridge, Massachusetts. 
Rhomberg also serves as a consultant on 
endocrine disrupters to the American 
Chemistry Council, an association o 
chemical manufacturers. 

Vom Saal and his colleagues counter 
that this is precisely the type of sys- 
tematic evidence they can now pro- 
vide, thanks to a boom in endocrine-disrupter research. The most 
comprehensive review yet of the field’, published in March, included 
more than 600 studies — almost half of them published within the 
past five years — and found credible evidence of non-monotonic 
responses with low-dose health effects in 18 endocrine disrupters, 
including BPA, atrazine and vinclozolin. “We kept hearing from our 
critics that there aren't enough examples to prove this phenomenon 
is real, so we took that as a challenge,” says Laura Vandenberg, a post- 
doctoral fellow at Tufts University in Medford, Massachusetts, and 
the lead author of the review. 

Government officials in Europe and the United States are paying 
attention. “I find the Vandenberg review to be quite compelling and 
quite convincing,” says Linda Birnbaum, director of the US National 


“IF WERE GOING T0 TAKE 
THIS SERIOUSLY, WE NEED 
TO HAVE SOME EVIDENCE OF 
AREAL PHENOMENON. 
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Institute of Environmental Health Sciences (NIEHS) in Research 
Triangle Park, North Carolina. In an April editorial in the NIEHS- 
published journal Environmental Health Perspectives, Birnbaum 
argued that “it is time to start the conversation” about incorporating 
low doses and non-monotonic relationships into regulatory deci- 
sions’. Ata European Commission scientific conference on endocrine 
disrupters this June in Brussels, delegates failed to reach a consensus 
on the importance of non-monotonicity at low doses but they did 
agree that existing regulations need to be stricter, according to Bjorn 
Hansen, who heads the chemicals unit at the commission's Directo- 
rate-General for Environment in Brussels. In the United States, mean- 
while, the Environmental Protection Agency (EPA) and the Food and 
Drug Administration (FDA) are showing a new willingness to at least 
discuss the issue — although they say that major regulatory revisions 
are not on the table for now. 

Big changes are unlikely, some observers suggest, as long as the 
field remains so polarized. “There is a very large divide out there 
between the risk assessors and the endocrine scientists,’ says Thomas 
Neltner, who studies chemical food additives at the non-profit Pew 
Health Group in Washington DC, which has been trying to arrange 
arapprochement through a series of scientific meetings. “Our feeling 
is that the two sides have been talking past each other” 


EARLY SIGNS 

For as long as there has been controversy over the effects of endocrine 
disrupters, vom Saal has been at the centre of it. Lean and intense, the 
67-year-old native New Yorker is an amateur pilot who flies his Cessna 
to scientific conferences and is not shy about tangling with his critics 
wherever he lands. As a postdoctoral fellow at the University of Texas at 
Austin in the 1970s, vom Saal was startled to discover that subtle vari- 
ations in sex-hormone levels in the womb could have life-long effects 
on mice. A female mouse positioned between two males in the uterus 
would later, as an adult, display significantly more ‘masculine charac- 
teristics, such as aggression, than would a female surrounded prenatally 
by other females, vom Saal found’. The apparent cause: a minuscule 
amount of extra testosterone released by the neighbouring male fetuses. 

Experimenting first with natural hormones and the synthetic oestro- 
gen diethylstilbestrol (DES), vom Saal found that male mice exposed 
prenatally to very low levels of DES devel- 
oped heavier prostates than unexposed 
mice — making them more vulnerable 
later in life to prostate disease, includ- 
ing cancer. Strangely, however, he found 
that higher doses of DES did not trigger 
the same effect”. It was one of the first 
non-monotonic dose-response curves 
mapped for an endocrine disrupter. 
Since then, vom Saal and his Missouri 
colleague Wade Welshons have identified 
similar non-monotonic responses from 
a variety of endocrine disrupters, most 
notably BPA®, a ubiquitous ingredient of 
polycarbonate plastics and epoxy coat- 
ings, including in food packaging. 

Vom Saal’s early work helped to generate a torrent of international 
interest in BPA, including an ultimately successful campaign by activ- 
ists in the United States, Canada and some European countries to 
halt its use in baby bottles and toddlers’ sippy cups. It also helped to 
inspire a legion of researchers to look for — and often find — other 
endocrine-related effects in animals exposed to very low levels of BPA 
and other hormone mimics. At Tufts, for example, cell biologist Ana 
Soto won notoriety for discovering that early exposure to BPA can alter 
the development of mammary glands in mice and rats, spurring the 
growth of oestrogen receptors and leading to precancerous growths 
and non-invasive cancers’. In Spain, another cell biologist, Angel 
Nadal of Miguel Hernandez University in Elche, exposed human 
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pancreatic cells to BPA and mapped non-monotonic relationships 
between dose levels and altered glucose metabolism, a key risk factor 
for diabetes and obesity’. Epidemiologists jumped into the fray, too. 
They found associations between urine BPA levels and obesity in chil- 
dren®, and linked other endocrine disrupters to incidence of diabetes’. 

Their studies, and many others, depict a weird world of endocrine 
disruption that is as different from traditional toxicology as quantum 
mechanics is from the staid clockwork of Newtonian physics. When 
even minuscule quantities of BPA and other disrupters interact with 
hormone receptors at crucial moments in development — activating, 
jamming, hijacking or otherwise messing with their normal func- 
tion — they can give rise to strange-looking experimental results, 
especially when other hormones are thrown into the mix. 

At the University of Illinois at Chicago, for example, reproductive 
physiologist Gail Prins implants prostate-like glands, grown from 
human stem cells and mixed with mouse tissue, into young mice, 
and then feeds some of them very low doses of BPA. As the mice age, 
Prins also gives them low doses of oestradiol, a naturally occurring 
hormone that becomes more potent in human males as they age and 
is a known risk factor for prostate cancer. Between 35% and 40% of 
the mice fed BPA plus oestradiol have developed prostate cancer, 
compared with 10% of the mice that were given oestradiol only. Her 
working theory is that BPA binds with oestrogen receptors in prostate 
stem cells, reprogramming genes in ways that leave the cells more 
sensitive to oestradiol later on. “What’s remarkable,’ she says, “is that 
we're working with very low doses that are definitely within the range 
of normal human exposure.” She plans to publish her results in early 
2013, as soon as she has collected more data. 

The interplay of these types of receptor mechanisms can generate 
bizarre dose-response relationships, many of which are still being 
mapped out. Earlier this month, vom Saal’s group at Missouri pub- 
lished the first full non-monotonic dose-response curve for the widely 
used plastics ingredient known as DEHP, 
or di(2-ethylhexyl) phthalate’®. The Mis- 
souri team subjected 78 pregnant mice 
to an extremely wide range of DEHP 
doses — from 0.5 micrograms per kilo- 
gram of body weight per day all the way 
up to 500,000 micrograms. They found 
that the animals’ testosterone levels rose 
or fell in surprising ways, altering sexual 
development depending on the dose 
received. For male offspring, for exam- 
ple, the dose-response curve looked 
something like the profile of a craggy 
mountain. Serum levels of testosterone 
rose between the 0 and 0.5 microgram 
dose levels, then fell slightly at 1, rose 
again through 5 and 500 before declin- 
ing again at 50,000 and plunging at the 
500,000-microgram dose level. The high- 
est dose, in fact, was virtually identical to 
the results for unexposed controls. Seek- 
ing to deflect another potential line of attack from his critics, vom Saal 
conducted a ‘goodness of fit’ statistical analysis confirming that a non- 
monotonic curve best fits his data. 

The biochemical mechanism behind the strange DEHP curve is 
unknown — vom Saal says that it awaits further study — but research- 
ers have worked out the specific causes of other non-monotonic 
curves. One of the best understood examples involves not a pollut- 
ant, but a drug: the chemotherapy agent tamoxifen, which binds to 
oestrogen receptors in breast cells and has a dose-response curve 
shaped like an upside-down U. Very low doses have little effect on 
cancer cells, but as the drug builds up in breast tissue it actually stimu- 
lates tumour growth, triggering a painful ‘flare’ period for patients. 
When tamoxifen levels grow high enough to occupy most of the 
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available oestrogen receptors, the effect reverses and the drug begins 
to inhibit the cancer cells’ growth. “All of this is very well known to 
endocrinologists,’ says Thomas Zoeller of the University of Massa- 
chusetts Amherst, who studies the effects of endocrine disrupters on 
the thyroid. “Non-monotonic dose-response is a fact of life” 

A turning point was a 2009 scientific statement by the Endocrine 
Society in Chevy Chase, Maryland, the first in its 95-year history. It 
called endocrine disrupters a “significant concern to public health’; 
endorsed stricter regulations; acknowledged non-monotonic responses; 
and declared that “even infinitesimally low levels of exposure — indeed, 
any level of exposure at all — may cause endocrine or reproductive 
abnormalities”"’. Seven other scientific bodies then joined the Endo- 
crine Society in a letter of concern published last year in Science’. “That's 
as mainstream as you can get. It really changed the nature of the game,” 
says another long-time researcher in the field, molecular biologist Bruce 
Blumberg of the University of California, Irvine. 


PUT TO THE TEST 

Critics, however, say that the mere existence of non-monotonic 
responses and low-dose effects is not the point. What matters is the 
extent of the health concerns they raise. “There is non-monotonicity, 
but the question is, is it toxicologically relevant?” asks Jason Aungst, 
a supervisory toxicologist with the FDAs Center for Food Safety and 
Applied Nutrition in Silver Spring, Maryland. He and a senior toxi- 
cologist at the EPA, Earl Gray, argue that the low-dose health effects 
identified in studies by vom Saal, Soto and others are still relatively 
rare and have not been conclusively linked to major health prob- 
lems. Low-dose effects that are more clearly harmful, such as organ 
deformities, are usually monotonic, and can be identified under cur- 
rent regulatory testing protocols, according to Gray. “You could never 
say that non-monotonicity doesn’t happen, but as far as its relevance 
to risk assessment, we really haven't seen it in the high-quality stud- 
ies,” he says. 

The schism in the field is in part a 
result of the different types of tests that 
academic researchers and risk assessors 
carry out. Many of the private testing 
labs hired by manufacturers seeking reg- 
ulatory approval for new products are 
not equipped to do the radioimmuno- 
assay analyses required to measure 
extremely low chemical concentrations. 
Nor do private labs typically look for the 
complex biochemical changes, such as 
alterations in protein levels, that are now 
part of the standard tool kit of research- 
ers such as Zoeller, Soto and vom Saal. 
Instead, agency-mandated guideline 
tests are standardized; involve simpler 
assays that are easier to replicate; use 
higher numbers of test animals; and 
generally seek to identify more obvious 
health problems, such as acute toxicity, 
cancer and physical deformities. “We do validated studies, the basic 
researchers don't,’ says Rochelle Tyl, a developmental toxicologist 
at RTI International, a firm in Research Triangle Park that conducts 
research for clients such as regulatory agencies and chemical manu- 
facturers. “That doesn’t mean they’re wrong, it just means they're 
doing work that hasn't been validated.” 

Yet even when government and industry-hired scientists look for 
low-dose effects, they often don't find them. For example, Ty’ 8 (who 
conducted her studies for two industry groups) and Gray’* have each 
tested BPA at very low doses without finding the potent developmen- 
tal effects identified by vom Saal, Prins and others. Vom Saal and 
his allies counter that the Tyl and Gray studies were insensitive to 
low-dose effects because their positive-control animals, which were 
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Researchers have found that many endocrine-disrupting chemicals do not generate the standard monotonic dose-response curves seen for other types of compound. 


MONOTONIC CURVE 


In some cases, dose and response 
increase together. The plant oestrogen 
genistein, for instance, causes the 
mouse uterus to increase in weight. 


NON-MONOTONIC CURVES 


Mice exposed to moderate doses of 
bisphenol A develop the largest 


thought to induce tumour-cell 
proliferation, but high doses also 
trigger cell death. 


tumours. Moderate and high doses are 
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given oestradiol alone, received doses that were much too high. The 
dispute has been harsh and public, playing out at conferences and in 
a seemingly endless series of sharply worded journal articles, rebut- 
tals, counter-rebuttals and counter-counter-rebuttals. The arguments 
became so heated that Tyl stopped doing BPA work. “I gave up BPA 
when it ceased to be scientific and became personal,’ she says. “It 
became flag-waving and it became political” 

Largely because of the Tyl and Gray studies, neither the FDA nor the 
EPA have altered their risk assessments for BPA. The FDA still says that 
BPA has no adverse effects at levels below 50 milligrams per kilogram 
of body weight per day — a level that vom Saal contends should actu- 
ally be two million times lower, at 25 nanograms. Both agencies, how- 
ever, are now cooperating on a much larger study designed to settle the 
_ dispute. The newly launched US$20-million study, led by the NIEHS 
and the FDAs National Center for Toxicological Research, is the most 
ambitious effort ever to look for non-monotonic dose-response curves 
that include very low doses. Last month, researchers began hand- 
feeding BPA to about a thousand rats at five dose levels ranging from 
2.5 micrograms per kilogram of body weight up to 25,000 micrograms, 
plus two positive-control groups (which received much lower oestra- 
diol doses than either Tyl or Gray used) and an unexposed control 
group. Vom Saal, Zoeller and other academics will be participating in 
the tissue analysis, allowing them to look for an array of health effects, 
such as metabolic changes in the prostate and mammary glands, that 
go well beyond those in standard regulatory protocols. 

But with the results of the BPA mega-study not expected for at 
least five years, a major rewriting of chemical regulations to take 
non-monotonic low-dose effects into account still seems distant. 
The European Commission has to meet a self-imposed deadline 
of December 2013 to draft the first governmental criteria defin- 
ing endocrine disruption, but without a scientific consensus on the 
issue the criteria may end up addressing only high-dose effects, 
predicts Andreas Kortenkamp, a toxicologist at Brunel University 
in Uxbridge, UK, who has been advising the European Commis- 
sion. In the United States, meanwhile, the EPA and FDA have con- 
vened a joint working group to review the evidence accumulating 
in the peer-reviewed literature, but “not only is the jury still out, the 
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jury hasn't even had a chance to look at the evidence yet’, says Rita 
Schoeny, a senior science adviser at the EPA in Washington DC. 

Some veterans of the field have decided not to wait, and are 
collaborating on what amounts to an effort to bypass the regulatory 
system. They have written a paper'’, scheduled to be published in 
January in the journal Green Chemistry, that gives industrial chem- 
ists detailed advice on how to screen newly synthesized compounds 
for endocrine-related effects, how to test at very low doses and how 
to look for non-monotonic dose-response curves. The paper, anda 
companion website, are aimed at trying to head off potentially harm- 
ful endocrine disrupters before they reach the marketplace. 

An ancillary benefit, vom Saal hopes, will be to increase the 
pressure on regulatory agencies to curb exposures to compounds 
already on the market. “We're saying that if you care about develop- 
ing a safe chemical, here’s what you should do,” says vom Saal. “Who 
could argue with that?” = 


Dan Fagin is a science journalism professor at New York University 
and the author of Toms River: A Story of Science and Salvation 
(Bantam Books), to be published in March 2013. 
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The anatomy 
of politics 


From genes to hormone levels, biology may 
help to shape political behaviour. 


popular political advertisement from 

early this summer begins with US Presi- 

dent Barack Obama addressing a crowd of 

moon-eyed supporters. Suddenly, the screen goes dark 

to a crescendo of minor chords. Phrases such as “Fear 
and Loathing’, “Nauseating” and “Divide and Conquer” flash onto 
the screen, along with video clips of commentators complaining that 
Obama has used scare tactics to manipulate voters. In the final scene, the 
iconic poster from Obama's 2008 election campaign appears, the word 
HOPE transforming into FEAR as it bursts into flames. 

The advertisement, produced by the conservative organization 
American Crossroads in Washington DC, is typical of those that have 
come to dominate the US airwaves and YouTube in preparation for next 
month's presidential election. Emerging from both the right and the left, 
these commercials increasingly resemble horror films as they seek to 
sway voters by triggering basic emotions such as fear, anger and disgust. 

That strategy fits with emerging scientific evidence about how people 
acquire their political beliefs. In the past, political scientists agreed that 
social forces — most importantly, parents and the childhood environ- 
ment — strongly influenced whether people became conservative or 
liberal, and whether they voted or engaged in politics at all. “We now 
know that it is probably not the whole story,” says John Jost, a psycholo- 
gist at New York University. 

An increasing number of studies suggest that biology can exert a 
significant influence on political beliefs and behaviours. Biological 
factors including genes, hormone levels and neurotransmitter systems 
may partly shape people’s attitudes on political issues such as welfare, 
immigration, same-sex marriage and war. And shrewd politicians 
might be able to take advantage of those biological levers through 
clever advertisements aimed at voters’ primal emotions. 

Many of the studies linking biology to politics remain controversial 
and unreplicated. But the overall body of evidence is growing and might 
alter how people think about their own and others’ political attitudes. 

“People are proud of their political beliefs,” says John Hibbing, a 
political scientist at the University of Nebraska-Lincoln. “We tend to 
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think they’re the result of some rational responses to the 
world around us.’ But in fact, a combination of genes and 
early experiences may predispose people to perceive and 
respond to political issues in certain ways. Recognizing that could help 
the public and politicians to develop more respect for those with oppos- 
ing viewpoints. 

“Td like to see people have a little less chutzpah about their political 
beliefs, and understand that some people experience the world differ- 
ently,’ says Hibbing. 


INNATE IDEOLOGY 

The past few decades have seen a wave of research connecting genes 
to disorders such as schizophrenia, depression and alcoholism, and to 
complex outcomes such as sexual orientation and how far people pro- 
gress in education. But until the past decade, this trend largely passed by 
the field of political science. Modern politics seemed too divorced from 
basic human biology, too recent an innovation in human evolutionary 
history, to be influenced by genetic material. 

In 1986, Nicholas Martin and his colleagues published a study’ sug- 
gesting that genes could exert a pull on attitudes concerning topics such 
as abortion, immigration, the death penalty and pacifism. Martin, a 
geneticist now at the Queensland Institute of Medical Research in Bris- 
bane, Australia, used a classic behavioural-genetics technique: compar- 
ing genetically identical twins with fraternal twins of the same sex (who 
share only 50% of genes on average). The identical twins had similar 
political beliefs more often than fraternal twins did. Because twins tend 
to grow up in the same family environments, Martin’s team suggested 
that genes made the difference, and that they have a significant role in 
helping to shape attitudes on social issues. 

Although Martin's study had obvious implica- 


tions for political science, researchers in that field NATURE.COM 
ignored the work. The eugenics movement in _ Tolistentoa podcast 
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about topics that examined genetic differences 
among people. 

The publication “was like a stone down a 
well’, says Martin. “There was absolutely no 
reaction. It just lay there for 20 years.” 

But in the early 2000s, Hibbing and John 
Alford, a political scientist at Rice Univer- 
sity in Houston, Texas, learned about Mar- 
tin’s work. They reanalysed his data and 
incorporated similar data from a study’ of 
attitudes among US twins. In 2005, Hibbing 
and Alford published’ findings nearly identi- 
cal to those earlier studies — showing strong correlations between 
genetics and political views. These finally caught the attention of 
political scientists. 

It wasnt the kind of attention that Alford and Hibbing were hoping 
for, however. “They thought we were crazy,’ says Hibbing. 

But a few researchers, mainly in the United States, were intrigued 
enough to follow up with further work. James Fowler, a political scien- 
tist at the University of California, San Diego, used the twin method to 
show’ that voter turnout and political participation also had a genetic 
component. Peter Hatemi, a political scientist at Pennsylvania State Uni- 
versity in University Park, found results similar to Alford and Hibbing’s 
using twins from Australia, Denmark, Sweden and the United States, 
although the work has not yet been published. 


TROUBLE WITH TWINS 

The twin studies were far from definitive, in large part because such 
research cannot completely control for environmental factors. Com- 
pared with fraternal twins, genetically identical twins are more likely 
to have the same friends and to maintain regular contact as adults. 
Furthermore, parents, friends and teachers often treat identical twins 
more equally than fraternal twins. All of that makes it hard to unpack 
how much genes and environment each contribute to the shared politi- 
cal attitudes of identical twins. 


“T would like to 
see people have a 
little less chutzpah 
about their 
political beliefs.” 


A few attempts have been made to tease 
apart the various influences. In one study’, 
Hatemi found that identical twins who do 
not spend much time together are still more 
concordant than are fraternal twins who do, 
suggesting that genetic factors do matter. 
But the twin studies continue to face strong 
opposition. “I’m very sceptical about estimat- 
ing heritability from twin studies,” says Laura 
Stoker, a political scientist at the University of 
California, Berkeley. “The entire framework is 
built with a tonne of assumptions.” 

Twin studies also offer no insight into how the genome can nudge 
people to lean left or right on various political issues. For that, research- 
ers have started exploring candidate genes. Genes involved with the 
olfactory system and the neurotransmitters glutamate, dopamine and 
serotonin have all been linked to behaviours such as voter turnout® and 
ideology’, although these findings have come under scrutiny and are 
yet to be independently replicated. 

Jeremy Freese, a sociologist at Northwestern University in Evanston, 
Illinois, says that such studies have found “implausibly large” effects 
from individual genes. “What's been revealed over the past few years is 
just how vulnerable the candidate-gene approach is,” he says. 

Part of the problem, says Freese, is that studies linking specific genes 
to political behaviours have usually been published in political-science 
journals, rather than scientific journals, so editors and reviewers may 
not have picked up on some deficiencies in the studies. “The reviewers 
have not been familiar with the replication problems that exist,” says 
Freese. 

Christopher Dawes, a political scientist at New York University, 
acknowledges complications in some of his own studies of specific genes 
and says that more illuminating results might come from molecular 
techniques such as genome-wide association studies, which scan the 
genomes of large numbers of people, looking for sequences linked to a 
behaviour or trait. But researchers in this area are only just beginning 
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to use such resource-intensive strategies. 

“We're starting to better appreciate the limitations of the data and 
techniques,’ says Dawes. 

If other complex behaviours and traits are any indication, the answer 
is not going to be simple. Even for traits known to have a very large 
genetic component, such as height, the evidence points to the influence 
of thousands of genes, each applying a feather-light force. So it seems 
unlikely that a small number of genes can push 
someone towards being a liberal activist, a 
social conservative or a libertarian. 

Many researchers have come to the conclu- 
sion that it is premature to focus on the genetics 
of politics. “It doesn’t make sense to go after the 
most difficult part of the puzzle, says Alford. 

An easier approach is to investigate the path- 
ways that might connect genes with political 
behaviours and attitudes. One connection 
that has been suggested is personality. US 
conservatives may not seem to have much in 
common with Iraqi or Italian conservatives, 
but many political psychologists agree that 
political ideology can be narrowed down to 
one basic personality trait: openness to change. 
Liberals tend to be more accepting of social 
change than conservatives. Some studies® sug- 
gest that liberals tolerate more ambiguity and 
uncertainty, whereas conservatives are more 
decisive, conscientious and attracted to order. 

Theoretically, a person who is open to 
change might be more likely to favour gay mar- 
riage, immigration and other policies that alter society and are tradition- 
ally linked to liberal politics in the United States; personalities leaning 
towards order and the status quo might support a strong military force 
to protect a country, policies that clamp down on immigration and bans 
on same-sex marriage. But some researchers baulk at such simple links 
between personality and ideology. Evan Charney, a political scientist at 
Duke University in Durham, North Carolina, points out that conserva- 
tives sometimes embrace change, such as proposals in the United States 
to alter the tax code and welfare system. He also says that he and most 
people in his field are liberals — an imbalance that could bias how they 
interpret connections between personality and politics. 


VISCERAL REACTIONS 

Some researchers have sought to move beyond personality studies to 
evaluate how participants’ physiological reactions can influence how 
they interpret and respond to political issues. In 2008, Alford, Hibbing, 
Hatemi and others measured how people reacted to threatening images 
and sudden, loud noises’. People who blinked harder and showed 
heightened sensitivity — as gauged by their skin conductance — were 
more likely to favour gun rights, capital punishment and the war in Iraq 
than were those who showed less sensitivity. 

In another study”®, Hibbing showed subjects a series of emotionally 
charged images, including a spider on a man’s face, a maggot-infested 
wound, a cute rabbit and a happy child. People who described them- 
selves as conservatives tended to respond more strongly when looking at 
the negative images than at the positive images, whereas liberals reacted 
more to the positive pictures. Conservatives also stared at the negative 
images longer than liberals did, which Hibbing connects to the idea that 
conservatives are more likely to confront fearful or disgusting situations, 
making them more disposed to support a strong military and harsh 
sanctions for criminals. 

Some researchers are exploring whether hormone systems play a part 
in political attitudes. A few studies, for example, have looked at con- 
nections between people's prejudices and their levels of oxytocin — the 
feel-good hormone linked to empathy and bonding with loved ones. In 
one experiment", Dutch participants who had taken puffs of oxytocin 
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“If people spend 
most of their 
lives focusing on 
negative rather 
than positive, 
they’re probably 
going to havea 
different way of 
experiencing the 
world.” 


responded more favourably to Dutch people than to foreigners, sug- 
gesting a bias towards their own group. 

Hatemi and Rose McDermott, a political scientist at Brown University 
in Providence, Rhode Island, are currently investigating whether other 
hormones, such as testosterone and cortisol, have any connections with 
ideology. Many of the hormone studies done so far have come under 
attack, because they often rely on small samples and the reported effects 
are sometimes weak. 

Given the research so far, Hibbing remains 
“agnostic” about whether genes or environ- 
mental factors such as parental guidance influ- 
ence political behaviour the most. Either way, 
he says, it is difficult to change someone's mind 
about political issues because their reactions 
are rooted in their physiology. 

“If people spend most of their lives focus- 
ing on negative rather than positive, they're 
probably going to have a different way of 
experiencing the world than those who do the 
opposite,” says Hibbing. So people on the right 
are unlikely to be reached by arguments that 
seem rational to the left, or vice versa. 

But tapping into emotions might prove 
more successful. After the terrorist attacks in 
the United States on 11 September 2001, New 
Yorkers who had been directly exposed to the 
trauma of the events experienced a ‘conserva- 
tive shift} expressing views that were more 
patriotic, more supportive of the military and 
more religious than they had before’. Disgust, 
too, can shift attitudes. One study’’ suggested that people exhibit more 
conservative views when reminded of impurity — for example, by the 
proximity of a bottle of hand sanitizer, a sign reminding them to clean 
their hands or a foul smell. 

Does that mean that negative political adverts, designed to invoke 
fear and disgust, could actually change people’s views? Alford says that 
these commercials are targeted more at mobilizing favourable voters — 
and demobilizing the opposition. “People who run political campaigns 
know that it’s all about turnout,” says Alford. “It’s not about changing 
hearts and minds; it’s about changing who shows up on election day.” 

Regardless of whether biology shapes political choice, it may affect a 
person’s likelihood of voting in the first place. In an as-yet unpublished 
study, Hibbing has found that people with high levels of the stress hor- 
mone cortisol are much less likely to vote than are demographically 
similar people who have lower cortisol levels. Hibbing is currently 
working to find a way to ease voting for these ‘stressed’ people, perhaps 
through options such as postal voting. 

Alford says that the biggest impact of all this research may be to make 
political discourse more civil and accepting of differences. “It would be 
nice if political science made dinner tables a little more humane,’ he says. = 


Lizzie Buchen is a freelance writer living in San Francisco, California. 
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A girl drinks water drawn from a river in Ethiopia, where some 49 million people lack access to safe water. 


Wall Street’s thirst for water 


Moves towards a global water commodities market must be stopped. It will push the 
price of food far beyond the peaks of the past five years, warns Frederick Kaufman. 


arly last year, I published an article in 
Hivee Policy that explained how Wall 
Street profits from hunger. I traced 
the history of financial markets in food and 
noted how the prices of maize (corn), soya, 
rice and wheat had broken records three 
times in the past five years’. I examined 
the impacts of climate change and biofuel 
mandates on the grain futures markets, and 
argued that a global food-pricing system that 
once benefited farmers, bakers and consum- 
ers had been undermined by financial deriv- 
atives created by investment banks. 
These commodity index funds effectively 


destroyed the traditional ‘price discovery’ 
function of the grain futures exchanges in 
Chicago, Kansas City and Minneapolis, and 
turned these markets into profit engines for 
banks and hedge funds while driving up the 
price of our daily bread’. 

Although regulation of global food deriva- 
tives has been promised, years have passed 
and nothing has materialized. In Wash- 
ington DC, abuses of commodity markets 
and other fiddles resulted in 30,000 pages 
of new regulations: the Dodd-Frank Wall 
Street Reform and Consumer Protection 
Act of 2010. Predictably, implementation 
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of these laws has been challenged in court 
and stalled. Even if the regulations make it 
beyond the Beltway, there will be plenty of 
room for exceptions for the biggest banks. 

Therefore, it is wise to consider what 
global resource will be the next financial 
derivative. What could be more catastrophic 
than betting on the world’s food supply? 
What about our water? 

Speculators can already bet on snow, wind 
and rain through weather-related futures 
contracts bought and sold on the Chicago 
Mercantile Exchange. The market value of 
weather grew by 20% from 2010 to 2011. 
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> But the sector remains small — a paltry 
US$11.8 billion. Still, weather futures indi- 
cate how restless Wall Street has become to 
transform Mother Nature into the mother 
of all casinos. 

Some environmentalists argue that put- 
ting a price on fresh water may be our best 
bet to save the planet’s supply. The more 
it costs, the less we will waste. In fact, the 
financialization of precious resources under- 
lies the Economics of Ecosystems and Bio- 
diversity (TEEB), an international initiative 
hosted by the United Nations Environment 
Programme and supported by the Euro- 
pean Commission, Germany, the United 
Kingdom, the Netherlands, Norway, Swe- 
den and Japan. TEEB aims to calculate the 
worth of ecosystems down to the last trillion 
dollars, riyals or renminbi. And then there 
is the PES movement — payment for eco- 
system (or environmental) services, which 
refers to such things as the air we breathe and 
the water we drink. Among the many sup- 
porters of this concept are the World Bank 
and the UN Food and Agriculture Organi- 
zation. As TEEB’s 2010 report for business 
puts it: “Modern society’s predominant 
focus on market-delivered components of 
well-being, and our almost total dependence 
on market prices to indicate value, means 
that we generally do not measure or man- 
age economic values exchanged other than 
through markets.” 

Wall Street’s success in cashing in on 
the food bubble, Washington's inability to 
regulate global derivatives and the push to 
commodify nature through TEEB and PES 
converged into a single focus this summer, 
when drought descended on the United 
States. With it came a slew of doom-laden 
social, environmental and economic pre- 
dictions: there will be 3 billion “water- 
stressed” people on Earth by 2035; water 
shortages will become chronic, wildfires 
will be pervasive, monsoons will be even 
more unpredictable; and snow run-offs will 
radically decrease owing to increasingly 
sultry winters. 

At the same time, water is becoming 
essential to a widening variety of industries, 
from hydroelectric power and fracking to 
beer brewing and semiconductor manu- 
facturing. Hydrologists warn that water 
tables are dropping across Asia. Political 
scientists predict squabbles over the own- 
ership and use of Himalayan rivers, and 
every water-well driller in Nebraska knows 
that the Ogallala aquifer under parts of the 
midwestern United States is declining at an 
alarming rate. 

The implications are dire: the destruc- 
tion of aquatic ecosystems, the extinction of 
innumerable species and the risk of regional 
and international conflicts — the much- 
dreaded ‘water wars’ of the twenty-first 
century. What will Egypt do when Ethiopia 


dams the Blue Nile? What will happen when 
Yemen becomes the first country to run out 
of water? The short answer: nothing good. 


CASHING IN 

Investors of all stripes adore the apocalyptic 
vibe. Within the interstices of violence 
and chaos there will be money to be made. 
These days, the biggest profits do not come 
from buying or selling actual things (such 
as houses or wheat or cars), but from the 
manipulation of ethereal concepts like risk 
and collateralized debt. Wealth flows from 
financial instruments that are one step away 
from reality. 

Investing in a water index is now more 
popular than ever. There are more than 
100 indices’ that track and measure the value 
of stocks of companies in water-related busi- 
nesses, such as utilities, sewage treatment 
and desalination. Several offer healthy 
returns (see go.nature.com/zrl4qo). 

As a result, the World Bank and the Inter- 
national Monetary Fund — both always on 
the lookout for market-based security for 
the billions of dollars of credit they extend 
— have been pushing countries to privat- 
ize their resources. These include the lakes, 
streams and reservoirs of Argentina, Bolivia, 
Ghana, Mexico, Malaysia, Nigeria and the 
Philippines (see, for example, go.nature. 
com/iuwp8m). What better guarantee of 
prosperity than a rush of multinationals 
determined to generate revenues from 
something no one can manage without? 

So this summer, as cornfields from Ukraine 
to Kansas withered, as bacon shortages made 
headlines and dairymen fed candy to their 
cows, a new message congealed: the world’s 
next great commodity will not be gold or 
grain or oil. It will be 


“Within the water. Useable water. 
interstices of Although collecting 
violence and stakes in indices of 
chaos there publicly traded com- 
willbemoney panies is nice, and 
to be made. ” water certainly gener- 

ates predictable prof- 


its, wouldnt it be more efficient if it could be 
translated into a cash equivalent? Perhaps, 
plotted the hedgers and speculators, there 
should be a commodity market in water, 
as there is for gold and grain — a futures 
exchange in which assurances to deliver or 
accept water on some specified future date 
can be traded like cash. 

In certain respects, water is a likely candi- 
date for a futures contract on a commodity 
exchange. First, it meets the requirement of 
fungibility — the commodity pumped from 
one lake or river or stream is pretty much 
the same as that collected from an iceberg, 
extracted from an aquifer, or collected in a 
rain barrel. And water will soon meet the 
second requirement of commoditization: 
it is becoming increasingly liquid, as in 
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convertible to cash. Most obviously, water 
is global. River-basin management is as hot 
a topic for the Volta as it is for the Senegal’. 
From a money perspective, there is no differ- 
ence if the river is Spain’s Guadalquivir, the 
French Rhone, the Niger or the Sacramento 
in California. 

Financial forecasters perceive that much 
like traditionally traded commodities — 
precious metals, for example — the use- 
able water of the future will be so scarce as 
to need to be mined, processed, packaged 
and, most importantly, moved around the 
world. And they know that the demand will 
not go away. That is the thrust of the think- 
ing behind a global water futures market. 


HIGH STAKES 

The year 1996 marked a Rubicon in the 
history of water and money. The water from 
Californias Westlands irrigates an esti- 
mated $1-billion-worth of food a year; at 
2,000 square kilometres, it is the largest agri- 
cultural water district in the United States. In 
1996, the district introduced an electronic 
bulletin board for farmers to buy and sell 
their rights to Westlands’ water from their 
home computers. Trading water rights from 
a laptop is now a given, just as commodities 
that once could be bought and sold only on 
the trading floors of Chicago or Kansas City 
are now routinely trafficked by mathematics 
PhDs at hedge funds in Connecticut. If and 
when water becomes an exchange-traded 
derivative, it will join Brent crude, jet fuel, 
and soya bean oil — and be traded any time, 
anywhere, by anyone. 

Making money come out of the tap means 
that fresh water must be given a price any- 
where it is traded — a global price that can 
be arbitraged across the continents. Those 
in Mumbai or midtown Manhattan who 
understand the increasing value of water 
in the world economy will speculate on this 
undervalued ‘asset’? and their investments 
will drive up the cost everywhere’. A water 
calamity in China or India — and the food 
inflation, political instability and humani- 
tarian crisis that will surely follow — will 
reverberate in price spikes from London to 
Sydney. This is how bankers will profit. 

Economists have begun to model a 
global water-based futures market fea- 
turing financial puts, calls, shorts, longs, 
exchange-traded funds, indices of indices, 
options piled on top of options, and all sorts 
of opportunity for over-the-counter swaps. 
Flood-insurance companies will certainly 
want to buy stakes that could mitigate their 
financial risk. In fact, every corporation that 
conducts its business in a flood plain, any- 
where, would probably participate. Farmers 
will want to hedge their bets that it will or 
will not rain, as will frackers and fishermen. 
As for the speculators, we know who they 
will be. 


C. ALS/PANOS 


Currently, no one is trading water 
futures, but it won't take much to spark the 
market into life. When the state of Texas 
tallied up to $10 billion in economic losses 
due to the recent drought, academics set 
about theorizing how the water of the Rio 
Grande might be indexed for a futures mar- 
ket®. After the floods in Thailand last year 
caused economic losses totalling $46 billion, 
Thailand’s Securities and Exchange Com- 
mission studied the possibility of financial 
derivatives indexed to rainfall and dams’. 
The semiconductor manufacturer Intel 
might have been interested — the mud and 
muck that reportedly halted its chip-making 
operation in Thailand cost the company up 
to $1 billion. 

A truly global trade in water futures will 
have to wait until the financiers come up with 
a universally adopted measure of water stress. 
Until then, water futures markets will emerge 
as local phenomena based on local concerns. 
For example, in drought-riddled Australia, 
every element of an indexed futures market is 
ready to go on the Sydney Futures Exchange 
(see go.nature.com/u7hdas). In the Medin- 
ipur and Tumkur districts in West Bengal 
and Karnataka in India, where the monsoon 
has become less and less predictable, a south 
Asian water futures exchange has been con- 
ceptualized, to be traded on the Delhi Stock 
Exchange’. 

Futures trading will extend from the most 
pristine rivers to the barely legal effluvia of 
solid-waste plants. Commodity theorists 
in Switzerland have taken the first steps in 
setting up markets that will trade in water 


A woman in Jamam refugee camp in South Sudan holds up a container of water from a makeshift well. 


futures derived from sewage. The team calls 
its concept an ethical water futures market 
(see go.nature.com/dq6fm4). In my view, it 
is a financial platform to sell treated water to 
the highest bidder. 

In all of these cases, the futures contracts 
will emerge from valuations of relative water 
scarcity or plenitude based on an index of 
water levels in dams, average precipitation or 
other indicators and predictors. Ultimately, 
the financial instrument will have the same 
basic structure as the index funds which 
brought unprecedented levels of specula- 
tion to the global grain market, increasing 
the volatility in prices — volatility that 
futures exchanges were originally meant to 
blunt. After all, if the natural-gas industry 
can pay more for water than soya farmers, 
then the gas drillers will get the water and 
the soya will not. 


PRICED OUT 
The reverberations of a global water futures 
market can hardly be imagined. This much 
is clear: a water betting game will leave crops 
thirsting and push the global price of food 
far beyond the peaks of the past five years. 
The good news is that, unlike the failed 
attempts to regulate the derivatives markets 
in food, something can still be done in the 
case of water. There are plenty of examples 
of valuing water outside the realm of pure 
commodification. One of the best examples 
has been developed in the Ruhr basin in 
Germany’. This riverine resource is man- 
aged not by the invisible hand of the market, 
but by a policy-creating body called the Ruhr 
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Association. Cities, counties, industries and 
enterprises in the region are represented by 
associates and delegates. A total of 543 stake- 
holders negotiate water-abstraction fees and 
pollution charges. The politics may be messy, 
but it works. Unfortunately, that is the way 
with democracy. 

There is no easy panacea for the world’s 
water needs, least of all the global derivatives 
business, which has proved that it is not to 
be trusted with mortgage-backed securi- 
ties, much less our most precious resource. 
There is no need to initiate a futures market 
in water only to create yet more financial 
madness that seems to resist all attempts at 
regulation. This time around, let the business 
stop before it starts. m 
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Symmetries feature in the stained-glass ceiling of the Palace of Catalan Music in Barcelona, Spain. 


Why symmetry matters 


Mario Livio celebrates the guiding light for modern physics. 


ymmetries lie at the heart of the laws 

of nature. Early scientific giants such 

as Galileo Galilei, René Descartes 
and Isaac Newton did not speak in those 
terms, but symmetries were implicit in their 
ideas of a comprehensive framework of the 
Universe. And symmetries lie explicitly at 
the basis of modern physics, from general 
relativity to quantum field theory. 

To a physicist, symmetry is a broader 
concept than the reflective form of butterfly 
wings, or the rotational similarity of a 
triangular roundabout sign. In physics, to be 
symmetrical is to be immune to possible 
changes. Symmetry represents those stub- 
born cores that remain unaltered even under 
transformations that could change them. 

Sometimes symmetries break down. The 
announcement on 4 July that two teams at 
the Large Hadron Collider (LHC) at CERN, 
Europe’s particle-physics laboratory near 
Geneva, Switzerland, had discovered a 
Higgs-like boson marked the end of a heroic 
quest to confirm a ‘symmetry-breaking’ 
mechanism. That such a process endows 
particles with mass was suggested in 1964 
by theoretical physicist Peter Higgs and two 
other groups (Robert Brout and Francois 
Englert; and Gerald Guralnik, Carl Hagen 
and Tom Kibble). 


If the new boson is confirmed as the 
long-sought Higgs, our faith in symmetry as 
a core concept in physics will be cemented. 
Further tests of symmetry’s power as a guide 
to physical laws will follow, in the quantum 
realm and through concepts such as super- 
symmetry. But what is symmetry and how 
is it broken? 


THE LAWS OF NATURE 

Universal laws are symmetric under 
translation in space and time — they don’t 
change from place to place or from today 
to tomorrow. As astronomers know, hydro- 
gen atoms billions of light years away are 
subject to the same physics as on Earth. 
Because physics has no preferred direction, 
natural laws are also symmetric under rota- 
tion. Whether we measure orientation with 
respect to the farthest quasar or the nearest 
coffee shop, the rules are the same. 

Such symmetries are true of Newton's laws 
of gravity and motion, which he applied in 
the seventeenth century to falling apples, 
ocean tides, the Moon and planetary orbits. 
They are also true of those laws’ twentieth- 
century successors: Albert Einstein's special 
and general theories of relativity, with their 
revolutionary ideas about the constancy 
of the speed of light, the equivalence of 
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acceleration and gravity, and the warping of 
space-time by masses. 

Imposing symmetry on all frames 
of reference was Einstein’s motivation for 
developing relativity theory. He wanted to 
find a way of describing the laws of nature 
that would look the same to all observers, 
whether at rest, moving at a constant velocity 
or accelerating. 

In looking for symmetries, Einstein was 
following in the footsteps of James Clerk 
Maxwell, a physicist who died the same 
year Einstein was born, in 1879. Purely for 
mathematical balance, Maxwell added to 
equations describing electromagnetism an 
extra term that related electrical currents to 
a resulting magnetic field. In doing so, he 
unified all electric and magnetic phenom- 
ena then known, as well as the laws of optics. 


INTERNAL SYMMETRIES 

The symmetries that give rise to the Higgs 
boson are different. They originate in the 
probabilistic world of quantum physics — 
‘internal symmetries’ reflect the robust- 
ness of universal laws to changes in the 
identity of some elementary particles. Just 
as a particle’s position may be uncertain 
according to quantum rules, so may its 
identity. Some types of particles are thus 
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interchangeable within the equations. 

Electrons and neutrinos can be swapped 
without altering the laws of nature. This 
symmetry is central to the ‘electroweak 
theory, developed in the 1960s by physicists 
Steven Weinberg, Sheldon Glashow and 
Abdus Salam. The theory unifies electro- 
magnetism with the weak nuclear force, 
which is responsible for some radioactive 
decays (such as a neutron to a proton) and 
underlies nuclear reactions in stars. The 
Higgs boson has a role in breaking the 
symmetry of weak interactions. 

The strong nuclear force, which acts 
among quarks to make up protons and 
neutrons, and binds protons and neutrons 
to form atomic nuclei, is also subject to 
symmetries. The laws of physics are blind, 
for instance, to exchanges of up quarks and 
down quarks (protons are made of two up 
quarks and one down quark). 

The standard model of particle physics 
combines the electroweak theory with 
quantum chromodynamics, the theory of 
strong interactions within the nucleus. The 
model's great achievement is that it unifies 
three of the four known forces through 
which particles interact (gravity stubbornly 
continues to resist unification). 

Internal symmetries are local, or 
independent of the locations of particles. 
Quarks (or electrons and neutrinos) at dif- 
ferent points in space and time may be 
exchanged without consequence. Fields, with 
their associated gauge bosons, mediate these 
remote interactions. The electromagnetic 
interaction is carried by the photon. In elec- 
troweak theory, the photon is joined by the 
three carriers of the weak nuclear force: W*, 
W and Z’ particles. Gluons shepherd the 
strong force between quarks. 

The symmetries associated with a given 
fundamental force act on all associated 
elementary particles, not just the force 
mediators. The same electroweak symmetry 


FUNDAMENTAL FORCES 


that treats the W*, Wand Z° particles as 
equivalent, for instance, applies to the 
electron and neutrino. The symmetry 
requirement dictates which particles and 
interactions are necessary for a given theory. 

Yet the cosmos does not always manifest 
perfect symmetry. The equations describing 
the electroweak interaction, for example, are 
symmetrical. They do not change when a 
photon is swapped with a W or Z particle. 
But the solutions to these equations — the 
particles themselves — are not identical. The 
photon is massless, and the W and Z particles 
are about 100 times heavier than a proton. 
The symmetry of the governing equations is 
somehow lost, or broken, in physical reality. 


BROKEN SYMMETRIES 

The concept of spontaneous symmetry- 
breaking allows physicists to preserve 
a symmetric theory while confronting 
puzzling observations. 

In the 1930s, the Jewish—Russian physicist 
Lev Landau realized that phase transitions 
are accompanied by a loss of symmetry. 
Take magnetic materials. When hot iron is 
cooled below its Curie temperature (around 
770°C), it acquires bulk magnetism. Its 
internal magnetic fields, pointing in random 
directions when hot, collectively settle on 
one orientation. The symmetry is broken. 
The equations describing the magnetic field 
within the iron are symmetric — they have 
no preferred orientation. It is the physical 
state of the iron that changes. 

Higgs and his colleagues realized in 1964 
that symmetry-breaking could be applied to 
particle physics. They proposed that a frac- 
tion of a second after the Big Bang, as the 
Universe expanded and cooled, it went 
through a dramatic phase transition (see 
‘Fundamental forces’). The internal symmetry 
of the weak interactions, which held true at 
very high energies, broke when the Universes 
energy dropped below some threshold. 


After the Big Bang, the four forces divided as the cooling Universe underwent phase transitions. 
The Higgs boson broke the symmetry between the electromagnetic and weak nuclear forces. 
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The mechanism by which it did so (the 
Higgs mechanism) involves a quantum field 
(the Higgs field), which has a non-zero value 
associated with every point in space. The 
Higgs particle is a ripple, a parcel of energy, 
in the Higgs field. 

The Higgs field tugs on W and Z particles, 
restricting their communication of the weak 
force to an extremely short range (less than 
about one-ten-thousand-trillionth of a 
centimetre). In other words, it gives the W 
and Z particles inertia, or mass. In similar 
fashion, the molasses-like Higgs field gives 
mass to other fundamental particles, such as 
electrons and quarks. 

Because the vacuum does not carry 
electrical charge, the photon travels 
unhindered. So the photon remains mass- 
less and can render the electromagnetic force 
over long distances. 

So far, the particle discovered in July at 
the LHC looks a lot like the Higgs boson. 
More tests are needed to prove it. First, the 
experimentalists must determine the 
quantum spin of the new boson (the Higgs 
is predicted to have no spin). Second, they 
need to measure the rates at which it decays 
into other particles and compare those to 
theoretical expectations. Even if the boson 
passes these tests, symmetry and its breaking 
do not leave centre stage. 

One of the major steps beyond the 
standard model involves supersymmetry 
— the idea that each particle we know has a 
not-yet-discovered superpartner, with a spin 
removed by half a quantum-mechanical unit. 
Supersymmetry is manifestly broken; other- 
wise the superpartners would have had the 
same masses and charges as the known par- 
ticles and would have been detected already. 
A broken supersymmetry opens the door toa 
host of other potential bizarre processes, such 
as an electron transforming into a muon. 

There are no signs as yet from the LHC 
of supersymmetric particles, but this could 
change. Although the simplest versions of 
supersymmetry seem to have been ruled out, 
no one knows what to expect when the LHC 
increases its energy in two years. 

Of course, the ultimate goal remains an 
all-embracing theory that will unify gravity 
with the other interactions. We still do not 
know if the underlying principle of such a 
theory is symmetry, but a confirmation of 
the newfound boson as the Higgs will show, 
once again, that symmetry is a guiding light 
through nature’s labyrinth. = 
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AUTUMN BOOKS 


ENVIRONMENTAL SCIENCES 


Divided we save 


An expanded model for heading off planetary tipping points offers much to both 
optimists and the apocalyptically inclined, finds Peter Kareiva. 


essimism usually seems smarter than 
Pp optimism. The pessimist is the ant in 
Aesop’s fable: the realistic, sage, cau- 
tionary voice. The optimist is the grasshop- 
per: the naive, un-analytical fool. Thomas 
Malthus was perhaps the most famous pes- 
simistic intellectual, writing in his 1798 An 
Essay on the Principle of Population: “The 
power of population is so superior to the 
power of the earth to produce subsistence 
for man, that premature death must in some 
shape or other visit the human race.” 
The latest contribution to this grand 


apocalyptic tradition is Bankrupting Nature. 
But, unlike previous pessimistic assessments 
of tipping points for humanity and Earth, 
this book by Johan Rockstrém, head of the 
Stockholm Resilience Centre, and Anders 
Wijkman, a former member of the European 
Parliament, combines a critique of political 
and economic systems with an assessment 
of global ecology. It is an extension of the 
influential planetary-boundaries idea that 
Rockstrém and colleagues first published in 
Nature in 2009. 

Many scientists and scholars have issued 
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warnings about impending famine and 
environmental decimation, only to witness a 
relentless improvement in the human condi- 
tion; but that is no reason to dismiss this book. 
First, the framing of limits in the past has 
primed important discussions and motivated 
change. Second, the ten planetary boundaries 
that Wijkman and Rockstrém have set out 
are not grounded in human thresholds, but 
are based on ecological, biogeochemical and 
atmospheric processes such as ocean acidifi- 
cation, biodiversity loss and freshwater use. 
Most importantly, no one can dispute that 


ILLUSTRATIONS BY SPENCER WILSON 


human impacts on the planet are greater than 
ever — and still increasing, as the population 
grows and developing nations aspire to North 
American levels of consumption. 

And the stakes are high. If one mistakenly 
warns ofa doom that is not really around the 
corner, the cost might be slower economic 
growth. But if one fails to see the true apoc- 
alypse on the horizon, the consequences 
could be calamitous. For this reason, 
Bankrupting Nature deserves our attention. 

The book's arguments are familiar, but 
rarely have they been gathered together in 
one place to clarify the links between poli- 
tics, economics and ecology. Wijkman and 
Rockstrém lay out the dangers inherent in 
ignoring the planet’s boundaries, and focus 
on renewable resources as an energy solution. 
They make a clarion call for transforming 
our economic system, replacing the out- 
dated economic indicator of gross domestic 
product with broader human-welfare met- 
rics such as Chilean economist Manfred 
Max-Neef’s model and its incorporation of 
human needs, from freedom to creativity. 

There are strengths and weaknesses in 
each of these discussions. Wind and solar 
power, the featured renewables, are really 
only an electricity solution. That leaves out 
many of our energy demands. And there is 
still a need to be as critical of problems with 
the transmission, storage and land footprints 
of renewable alternatives as we have learned 
to be with fossil fuels. 

Regarding a new economics, the book 
spins several nice stories, but is light on 
analytical policy options. Take the authors’ 
championing ofa ‘circular’ economy, based 
on service, virtual assets and re-use as 
opposed to ever-expanding consumption 
of raw materials or a no-growth economy. 
What about the server farms that store and 
deliver all our information? These are energy 
hogs full of metals such as copper and silver, 
which must be extracted through often- 
damaging mining processes. Even recycling 
is not always environmentally benign. 

Unlike many ‘limits to growth proclama- 
tions, however, Wijkman and Rockstrém’s 
book makes the useful 
distinction between 
global and local per- 
spectives. So although 
the nitrogen cycle is 
globally disrupted by 
excessive fertilization, 
additional fertilizer 
is still a good idea for 
depleted soils in Africa. 
Recovery, restoration 
and resilience might 
have been included, as 
in many cases, restora- 
tion remains possible 
even where bounda- 
ries have been locally 
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exceeded and ecosystems have collapsed. 

It has to be said that the boundary fram- 
ing does not lend itself to policy actions. For 
example, we need, however roughly, to 
estimate the likelihood of people suffer- 
ing severe impacts from climate disrup- 
tion, and then to analyse the costs and 
benefits of different actions that could influ- 
ence the carbon dioxide emissions trajectory. 
Itis the relative effectiveness of natural gas at 
replacing coal that is the central question — 
not the planetary boundary for CO,. 

The authors sensibly sidestep the bound- 
ary model in discussing population growth, 
which they emphasize is the ultimate driver 
of many environmental problems. Instead, 
they focus on planetary opportunities: proven 
ways of reducing population, such as modest 
investments in family planning, economic 
opportunities for women and education of 
young girls. Similarly, in the developed world, 
we can modify consumption using modern 
marketing, education and pricing incentives. 
Approaching CO,, acidification, nitrogen and 
the many other planetary boundaries in the 
same way would have been useful. 

However the approaches are shaped, Wijk- 
man and Rockstrém remind that action is 
imperative to stave off crisis. But a crisis of 
what magnitude? They warn of a coming 
apocalypse triggered by climate disruption 
and resource scarcity, drawing on Gwynne 
Dyer’s book Climate Wars (Random House, 
2008), with its scenarios of civil wars, conflicts 
between nations and collapsed governments. 
Earth does face severe environmental chal- 
lenges, but these are a far cry from impending 
collapse. And although Rockstrém points to 
the risk of science being drawn into the politi- 
cal process, modern environmental science 
is all about examining trade-offs between 
choices in energy, resource use, infrastructure 
and waste disposal. 

The ideas in this book are interesting and 
often inspiring, and, to some extent, the 
future of sustainability depends on wrestling 
them to the ground. In doing so, it is key to 
recognize that some scientists are optimists 
and others pessimists. To progress, this 
debate about limits and tipping points needs 
to be recast as a series of testable hypotheses 
about resource extraction, land and water 
use, and energy portfolios, addressable by 
models and data. 

Some say that innovation, technology and 
new partnerships between business and con- 
servation hold the answers to a bright future. 
Others argue that slower growth, happiness 
indicators and strong regulatory constraints 
are the only way to avert the apocalypse. It is 
premature to predict who will win the debate 
— the grasshopper or the ant. m 


Peter Kareiva is chief scientist at The 
Nature Conservancy in Arlington, Virginia. 
e-mail: pkareiva@tnc.org 


25 
© 2012 Macmillan Publishers Limited. All rights reserved 


BOOKS & ARTS | COMMENT | 


ENVIRONMENTAL 
SCIENCE IS ALL 
ABOUT EXAMINING 


OCTOBER 2012 | VOL 490 | NATURE | 475 


MATHEMATICS 


A fractal life 


Mark Buchanan enjoys the quirky memoir ofa 
mathematical rebel — the late Benoit Mandelbrot. 


t the start of The Fractalist is a photo- 
Az dated June 1930. In a Jewish 

family’s apartment in Warsaw, four 
Polish mathematicians are hosting a meal 
for an honoured guest — French mathema- 
tician Jacques Hadamard. Somewhere in the 
same house, although not pictured, would 
have been a six-year-old named Benoit. The 
family’s surname was Mandelbrot. 

A hugely productive theorist of geometry 
and roughness, and the “father of fractals”, 
Benoit Mandelbrot died aged 85 on 14 Octo- 
ber 2010. The Fractalist is his posthumously 
published autobiography, polished with help 
from his former assistant and colleagues. 

A collection of poignant vignettes, the 
memoir paints Mandelbrot as an intellectual 
individualist who created his own field. He 


NEWIN 
PAPERBACK 


Highlights from this 
season’s releases 


was reputedly a prickly character, self-impor- 
tant and quick to take offence. Traces of his 
strong opinions come through, but they are 
overwhelmed by warm and delightful remi- 
niscences of family and colleagues. 
Mandelbrot explains that in that 1930 
photo are several of the most important peo- 
ple in his life. Key among them is his mathe- 
matician uncle, Szolem. The first in the family 
to attend university, Szolem Mandelbrot had 
travelled to Paris to study mathematics under 
Hadamard, and went on to hold the same 
chair at the College de France as Henri Poin- 
caré. He was Mandelbrot’s informal mentor, 
and linked the family to international science. 
In 1936, as European civilization began to 
dissolve, Mandelbrot’s “lucid and decisive” 
parents took the family to Paris to join Szolem. 
Benoit recalls learning about the history and 


architecture of the city 
during long walks. 
When the war started, 
the family fled occu- 
pied Paris for rural 
Tulle in the unoccu- 
pied Limousin region. 

Danger was never 
far away, and in 1943, 
the family went into 
hiding and the parents scientific Maverick 
and children split up. — eNoiT MANDELBROT 
Posing as apprentice Pantheon: 2012. 
tool-makers, Benoit 352 pp. $30 
and his cousin Leon 
aroused suspicion and were arrested by the 
police, who were seeking the culprits of a 
bombing. Days of anguish followed before 
a helpful official intervened on their behalf. 

Through such kindnesses, and the deter- 
mination of his parents, Mandelbrot survived 
the war. Back in Paris, his mathematical skill 
gained him entrance to the prestigious Ecole 
Polytechnique. This would have guaranteed 
him a comfortable career in France, but Man- 
delbrot never settled for the safe option. 

In one vignette, Mandelbrot recalls a “life- 
altering verbal lashing from Szolem”. It was 
1952 and Mandelbrot, aged 28, was back 
in Paris after studying fluid dynamics and 
aircraft design at the California Institute of 
Technology in Pasadena. He hadn't chosen 
a topic for his PhD, and Szolem berated him 
for excessive book learning and preparation 
to bea “well-trained monkey”. 

Szolem suggested pursuing quadratic 
dynamics, the sometimes complex behav- 
iour of mathematical functions when iter- 
ated — plug in a number, take the result and 
plug that back in again, endlessly. Mandelbrot 
looked into the topic, then dropped it. But the 
lashing had an effect. Soon, Szolem pointed 
Mandelbrot to the linguist George Zipf, who 
had found a scaling law in the relative fre- 
quency of word use that seemed to be univer- 
sal across languages. “Silly stuff only you can 
like” was Szolem’s view. Benoit, to the alarm 
of his professors, chose it as his thesis topic — 
and so set a course to revolutionize science. 

Mandelbrot’s insight that the apparently 
random might harbour hidden order broad- 
ened over his postdoc positions at the Mas- 
sachusetts Institute of Technology (MIT) in 
Cambridge; Princeton University in New 
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The Red Market: On the Trail of the World’s Organ Brokers, Bone Thieves, 
Blood Farmers, and Child Traffickers 

Scott M. Carney (William Morrow, 2012; $14.99) 

From illegal adoptions to a bag of tibias used to make flutes on the Indo- 
Bhutanese border, Scott Carney explores the illegal trade in human bodies 
and body parts. His investigation into this shadowy realm exposes a system 
that exploits donors, benefits middlemen and puts a price on our very 


existence. (See Laura Spinney’s review: Nature 474, 156-158; 2011.) 
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Jersey; and the University of Geneva in 
Switzerland. In 1958, he began a 35-year 
career at the IBM Research Center in York- 
town Heights, New York. 

His work on market fluctuations in the 
1960s could have revolutionized finan- 
cial economics if the field’s equilibrium- 
focused ideology hadn't pushed it aside for 
30 years. Perceiving parallels in everything 
from fractured surfaces to stock-market 
movements, Mandelbrot coined the term 
fractals and did more than anyone to make 
it possible to talk about natural roughness 
and disorder in a precise, scientific way. 

In 1979, he returned to Szolem’s sug- 
gestion of quadratic dynamics. Within a 
year, his explorations of functional itera- 
tions led him to discover the Mandelbrot 
set — an astonishing pattern of infinite 
richness produced by simple rules. This 
launched a whole branch of mathematics. 

Much of what makes The Fractalist fun 
to read is Mandelbrot’s scattered recol- 
lection of encounters with luminaries. 
At MIT he discussed linguistics with the 
young Noam Chomsky and argued with 
anthropologist Margaret Mead. Robert 
Oppenheimer liked his ideas, and Man- 
delbrot worked with the famed psycholo- 
gist Jean Piaget in Geneva and computing 
pioneer John von Neumann at Princeton. 

Mandelbrot recalls how after one of 
his lectures, a famous mathematician 
objected, saying he had made “absolutely 
no sense at all”. Oppenheimer and von 
Neumann sprang into action, explain- 
ing points that even Mandelbrot hadn't 
noticed. The meeting, he remembers, 
“went from abysmally low to unforgettably 
high”. His appreciation of friends, music 
and quirky mathematics colours every 
page of The Fractalist. Mandelbrot’s odd 
habits explain why he was so original: he 
avoided work involving direct competition 
with others, and naturally worked in the 
gaps between fields, in blind spots. 

“A youthful decision set me on a mav- 
erick’s lonely ride,” he writes. “Its conse- 
quences took a long time to develop.” m 


Mark Buchanan is a physicist and writer 
based in France. His latest book is The 
Social Atom. 

e-mail: buchanan.mark@gmail.com 


MEDICAL HISTORY 


BOOKS & ARTS | COMMENT | 


Stemming the red tide 


Lab luminaries jostle with consumptive cultural icons in a 
vivid history of tuberculosis, finds Stefan H. E. Kaufmann. 


uberculosis has killed more people 
Tis all wars combined, and was 

a leading cause of death in many 
Western megacities as recently as the 1950s. 
Historically, tuberculosis (TB) was hothoused 
in overcrowded European and US cities: at the 
start of the twentieth century in New York and 
Berlin, it killed 40% of people aged 25-40. Yet 
TB was in decline until the late twentieth cen- 
tury, when the HIV/AIDS epidemic in sub- 
Saharan Africa triggered a resurgence. Now, 
it is the number one cause of death among 
people with HIV, and the incidence of multi- 
drug-resistant strains is rising. 

But the story goes beyond the medical. TB’s 
long history, wide spread and lethality irrevo- 
cably link it to social and cultural history, as 
Helen Bynum reveals in her gripping history 
of the disease and its impacts, Spitting Blood. 

Bynum kicks off with the case of George 
Orwell, which encapsulates the progression 
of TB and its impact on culture. In 1949, when 
Orwell published Nineteen Eighty-Four, he 
was in the late stages of the disease, which 
arguably influenced his novel’s dystopian 
tone; he died a year later. Bynum goes on to 
expertly turn the many facets of TB to the 
light, from biology to medicine and socio- 
economics. She ends with a brief account of 
why it has not been eradicated. 

TB is caused by Mycobacterium tuber- 
culosis, a bacterium that probably evolved 
from an environmental microbe to a human 
pathogen. The signs of ‘consumption’ or 
phthisis — pulmonary TB, triggered by aero- 
sol infection of the lung — are coughing and 
spitting of blood. Before the nineteenth cen- 
tury, another common form was scrofulosis, 
in which the lymph nodes became pus-filled 
and ulcerated. It usually arises from ingest- 
ing bacteria, so the increasing eradication of 
bovine TB and the sterilization of milk have 
radically reduced its incidence. Pott's disease, 
a third, rare form, affects the bones. 


Phthisis and scrofulosis were known in 
antiquity. But it was not until the late Renais- 
sance that anatomists came to recognize TB 
lesions, and spotted similarities between the 
manifestations of the disease. 

TB began to gain cultural cachet as art- 
ists succumbed. The death at 25 of English 
Romantic poet John Keats (a trained surgeon) 
did much to glamorize the disease. In Rome 
in 1820, Keats “vomited near two cupfuls 
of blood”; he died a few months later. Anne 
Bronté and, possibly, her sister Emily suc- 
cumbed at 29 and 30. Consumption became 
linked to the punishment and redemption of 
a bohemian life, as with the courtesan Violetta 
in Giuseppe Verdi's 1853 opera La Traviata. 

Meanwhile, as Bynum shows, the medical 
disease began to emerge, and “consumption 
became tuberculosis”. Louis Pasteur provided 
the first evidence that microbes caused cer- 
tain diseases. Robert Koch showed that TB 
was infectious, and demonstrated that a 
single pathogen was responsible for its dif- 
ferent forms. At the time, clinicians tried to 
treat the disease using the ‘pneumothorax’ 
method, collapsing the lung by inserting nee- 
dles into the pleural cavity, in the hope that 
pressure on the lung would lead toa cure. But 
the unveiling of TB as 
a bacteriological dis- 
ease paved the way for 
effective drugs, diag- 


Soap. nostics and a vaccine. 
i | In the second half 
of the nineteenth cen- 


tury, efforts to control 
public spitting began. 
Dispensaries were 


Te dleony . set up, then sanatori- 
Tuberculosis ums for the rich, the 
HELEN BYNUM first in Germany in 
Oxford Univ. Press: the 1860s. Working- 


2012.352 pp.£16.99, class sanatoriums 
$34.95 emerged courtesy > 


Knocking on Heaven’s Door 
Lisa Randall (Ecco, 2012; $16.99) 
What can CERN’s Large Hadron Collider (LHC) 


reveal about the make-up of the Universe? Particle 
physicist Lisa Randall explains, alternating details 


of the LHC’s inner workings with more general 
musings on the philosophy of science. (See 


Joseph Silk’s review: Nature 477, 30-31; 2011.) 
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Survivors: The Animals and Plants that Time 
Has Left Behind 

Richard Fortey (Harper, 2012; £9.99) 
Palaeontologist Richard Fortey narrates the 
history of life by looking not to the long-extinct, 
but to organisms that have survived, almost 
unchanged, for millions of years. These survivors, 
he says, speak to us of pivotal evolutionary events. 


25 OCTOBER 2012 | VOL 490 | NATURE | 477 
. All rights reserved 


| COMMENT | BOOKS & ARTS 


> ofnational-insurance programmes and 
philanthropy. People with joint and bone 
TB, often children, underwent orthopaedic 
treatments, such as wearing shells and 
jackets that left just face and ears exposed. 
In 1921, as Bynum reveals, Albert Cal- 
mette and Camille Guérin devised the 
first breakthrough: the vaccine bacille 
Calmette-Guérin (BCG). Twenty years 
later, Albert Schatz and Selman Waksman 
discovered the second: the TB drug strep- 
tomycin. Intensive research spawned 
others, and the 1950s saw the first broad- 
scale fight against TB, with mass X-ray 
screening and drug therapy. But although 
national TB-control programmes sprang 
up, interrupted treatments and poor 
patient compliance led to drug resistance, 
notably in regions of conflict and asylum 
camps. From the beginning, resistance 
against single drugs was noted, and clini- 
cal trials spearheaded by the UK Medical 
Research Council revealed that multi- 
drug therapy was essential for preventing 
the development of drug-resistant strains. 
A rigorous multi-drug treatment 
programme led by the World Health 
Organization (WHO) effectively defeated 
non-resistant TB, but about 50 million peo- 
ple currently harbour multi-drug-resistant 
tubercle bacilli. Of these, nearly half a mil- 
lion develop the disease each year. With 
around 15 million people co-infected with 
HIV and tubercle bacilli, and nearly 1 mil- 
lion active TB cases a year among people 
with HIV, we are seeing a perfect storm. If 
Bynums book has a weakness, it is that it 
lacks an outlook on how the WHO's goal 
to eliminate TB by 2050 could be achieved. 
Spitting Blood provides impressive 
insight into TB as a medical and social 
disease. Meanwhile, photographer James 
Nachtwey’s gallery of people with drug- 
resistant TB (www.xdrtb.org) comple- 
ments this work on a disease that is still 
very much with us. In 2009, London alone 
harboured nearly 3,500 TB cases: an almost 
50% increase over the preceding decade. = 


Stefan H. E. Kaufmann is director of the 
department of immunology at the Max 
Planck Institute for Infection Biology in 
Berlin. 

e-mail: kaufmann@mpiib-berlin.mpg.de 


A More Perfect Heaven: How Copernicus 
Revolutionised the Cosmos 

Dava Sobel (Bloomsbury, 2012; £8.99) 

Mixing drama with history, Dava Sobel offers a 
biography of Copernicus with a twist, working } 
in a two-act play in which a student convinces 
him to publish his revolutionary work. (See Owen 
Gingerich’s review: Nature 477, 276-277; 2011.) 
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Margaret Sanger: A Life of Passion 

Jean H. Baker (Hill and Wang, 2012; $17.00) 
Margaret Sanger emerges as a daring and 
determined character in historian Jean Baker’s 
vivid life. Sanger’s fervent belief that women 
should be able to limit the size of their families led 
to the development of the contraceptive pill. (See 
W. F. Bynum’s review: Nature 478, 318; 2011.) 
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The search for Earth’s twin 


Sara Seager enjoys a frank and vivid account of planet hunting. 


eople have pondered the existence 
P= Earth-like worlds beyond our 

Solar System for millennia. In 1995, 
the first exoplanet was spotted, orbiting a 
Sun-like star. Fewer than two decades later, 
hundreds of planets and thousands of can- 
didates are known. Yet the sheer variety of 
exoplanet masses, sizes and orbits indicates 
that close copies of the Solar System are rare. 

In Mirror Earth, science writer Michael 
Lemonick relates astronomers’ abiding inter- 
est in sister worlds, from ancient Greece — 
where Aristotle surmised that there is only 
one world, Earth — to today, and beyond. 
Spotting a small, light and faint planet near 
its massive, bright host star is difficult and 
requires abstruse techniques. Lemonick sets 
out these methods — radial velocity, transits, 
microlensing, direct imaging, astrometry 
and pulsar timing — with impressive clarity. 

NASAs Kepler space telescope is rightly a 
focus of the book. The Kepler mission’s goal 
is to establish how common Earth-size plan- 
ets in Earth-like orbits about Sun-like stars 
are. But, as Lemonick is careful to spell out, 
these will not necessarily be Earth ‘twins’ — 
planets with water oceans, continents and 
thin atmospheres conducive to life. 

Knowing a planet's size doesn't tell us 
whether it is habitable. Venus, for example, 
is about the same size as Earth but not at all 
Earth-like. A thick carbon dioxide atmos- 
phere causes greenhouse heating of the sur- 
face to temperatures exceeding 700 kelvin, 
hot enough to melt lead. 

The Kepler telescope, launched in March 
2009 and trailing Earth as the planet orbits 
the Sun, is making astonishing discover- 
ies. One is that ‘mini-Neptunes’ — plan- 
ets a few times Earth’s diameter and a little 
smaller than Neptune — are many times 
more common than larger Jupiters, planets 
about 11 times the size of Earth. There are 
hints that smaller planets may be even more 
numerous. This tantalizing finding could 


imply that rocky planets (thought to reach 
no more than 1.75 times Earth’s size) out- 
number their gas-rich cousins. 

Kepler has also vastly increased our knowl- 
edge of the variety of planetary systems. These 
include single planets orbiting two suns and 
dozens of multiple-planet systems, some in 
surprisingly compact orbits. The book covers 
the zoo of planetary types thought to exist, as 
well as theories of planet formation, condi- 
tions for habitability and the origin of life. 

Lemonick describes Kepler’s rocky road 
from conception to launch, including how 
its technical feasibility was repeatedly chal- 
lenged, leading to the proposal being rejected 
by NASA several times. And he discusses the 
implication of the recent realization that most 
Sun-like stars are more variable than our Sun. 
That means Kepler must scour them for twice 
as long to pick out the signature dimming of 
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passing planets, doubling the time it will take 
to reach the census goal. 

Finding exoplanets is an art as well as a 
science, and Lemonick gives examples to 
demonstrate how hard it can be to tease a 
weak signal out of data. “It’s my Rembrandt,’ 
declared exoplanet hunter Paul Butler of his 
computer code — six years in the making 
— that cracked the puzzle of how to meas- 
ure the precise motion of a star caused by 
an orbiting planet. Controversy invariably 
follows about which detections are robust, 


and Lemonick cap- 
tures some of those 
lively exchanges with 
quotes and stories. 

Lemonick provides a 
brutally honest charac- 
ter exposition of well- 
known exoplaneteers. 
He details Geoff Mar- 
cy’s feelings of being, 
in Marcy’s words, “an 
impostor” early in 
his career owing to 
a severe lack of con- 
fidence that plagues 
many young scientists, and Debra Fischer's 
unusual route into astrophysics via nursing. 

One intriguing thread in the book is the 
tension between scientists and journalists. 
Exoplanet discoveries are ripe for mis- 
interpretation and exaggeration. The media 
and the public want an exciting story even 
when there is none, and it is easy for a sci- 
entist to get caught up in the hype. Lem- 
onick describes how one astronomer felt 
he was induced by a journalist to say that 
“the chances for life” were “100 percent” 
ona new planet, the very existence of which 
is now challenged. 

Mirror Earth captures the excitement of 
planet hunting and the ups and downs of 
discovery. Yet today, the quest for an Earth 
twin remains just out of reach. To identify 
a truly Earth-like world will require a more 
sophisticated space telescope than Kepler, 
to look for water and signs of life. Lemon- 
ick speculates on how that might happen. It 
may take decades to see how the story really 
unfolds. Stay tuned. m 


Mirror Earth: The 
Search for Our 
Planet’s Twin 
MICHAEL D. LEMONICK 
Walker: 2012. 304 pp. 
$26, £19.99 


Sara Seager is a professor of planetary 
science and a professor of physics at the 
Massachusetts Institute of Technology in 
Cambridge. 

e-mail: seager@mit.edu 


World in the Balance: The Historic Quest for an 
Absolute System of Measurement 

Robert P. Crease (W. W. Norton, 2012; $17.95) 
From weighing out food to telling the time, we 
have long sought to measure experience. Robert 
Crease traces the origins of such rules, which now 
seem “part of the contour of things”. (See Andrew 
Robinson’s review: Nature 478, 32-33; 2011.) 


The Abacus and the Cross 

Nancy Marie Brown (Basic Books, 2012; $16.99) 
The idea that anti-scientific superstition defined 
the Middle Ages is challenged by science writer 
Nancy Marie Brown in this life of ‘Scientist Pope’ 
Gerbert of Aurillac. Gerbert devised early forms 
of the computer and the planetarium, and his 
teaching helped spread science through Europe. 
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PSEUDOSCIENCE 


A fringe too far 


David Morrison finds contemporary echoes in a history of 
‘science wars’, from Velikovsky to Lysenko and beyond. 


denial to cold fusion, pseudoscience 

pervades the media, often drowning 
out real science. Michael Gordin’s scholarly 
and highly readable The Pseudoscience Wars 
uses Immanuel Velikovsky’s ‘cosmic catas- 
trophe’ theory — once a hugely influential 
recasting of ancient history — as the organ- 
izing principle for a discussion of modern 
pseudoscience and “science wars”. The book 
also covers Lysenkoism and young-Earth 
geology in depth. 

Gordin, a science historian, asserts that 
scientists apply the label of pseudoscience — 
which he defines as “doctrines that are non- 
science but pretend to be, or aspire to be, or 
are simply mistaken for scientific” — only to 
ideas whose public appeal makes them seem 
threatening. His goal is to examine how sci- 
entists try to deal with the issue of outsiders’ 
and understand their thinking, and what light 
the Velikovsky phenomenon sheds on that. 

For much of the past half-century, Veliko- 
vsky’s big idea both enjoyed public popular- 
ity and drew the opprobrium of scientists. 
A Russian immigrant to the United States, 
Velikovsky was a psychiatrist by training, yet 
thought of himself as a historian who used 
the insights of psychoanalysis to interpret the 
myths of early civilizations. He collected and 
correlated myths from the first and second 
millennia Bc, many dealing with catastro- 
phes such as the Egyptian plagues described 
in the Bible's Old Testament. Eventually, he 
began to propose cosmic events that could 
explain things that happened in the myths. 

By the late 1940s, Velikovsky had convinced 
himself that Earth had nearly collided with 
Venus and Mars between 2000 and 800 Bc. 
This grand synthesis was published in 1950 
as Worlds in Collision, followed by several 
further books and dozens of articles. In spite 
of scathing criticism from other scientists, he 


ae creationism and climate-change 


never wavered from 
his conviction: a few 
weeks before his death 
in 1979, he said: “No 
one has disproved 
Worlds in Collision? 
The book became a 
best-seller. But Veliko- 
vsky wanted to be rec- 
ognized as a bona fide 
scholar. He was stung 
by harsh reviews, 
including pressure 
from prominent sci- 
entists that resulted 


The 


Pseudoscience 
Wars: Immanuel 
Velikovsky and 
the Birth of the 
Modern Fringe 
MICHAEL D. GORDIN 


University of Chicago . : 

in the book being 
Press: 2012. 304 pp. : 
£18.50, $29 dropped by Macmil- 


lan (which publishes 
this journal), then the main publisher of 
science textbooks in the United States, and 
transferred to the trade publisher Doubleday. 
Velikovsky sought scientific legitimacy 
by courting individual scientists, especially 
Albert Einstein, who, like Velikovsky, lived 
in Princeton, New Jersey. Gordin discusses 
these efforts and the mainly hostile reactions 
to them. Scientists recognized that Velikov- 
sky’s planetary collisions, and his suggestion 
that electromagnetism rather than gravita- 
tion dominated planetary motions, were (in 
Einstein's term) “crazy”. Velikovsky was sim- 
ply not speaking the same language as the 
scientific community, yet his books were 
received uncritically by many of the public. 
If the story had ended there, he would 
quickly have retreated into oblivion. But in 
the 1970s, Velikovsky became a symbol of the 
US counterculture. His reissued books hit the 
best-seller lists again; students invited him to 
speak on college campuses, and he acquired 
a group of enthusiastic disciples. 
This assault from a crank baffled the 
scientific community. Gordin adds context 


How Old Is the Universe? 

David A. Weintraub (Princeton Univ. Press, 2012; 
$22.95) 

Astronomers do not believe the Universe is 13.7 
billion years old, writes David Weintraub; they 
know it. But how? Weintraub chronicles centuries 
of scientists’ painstaking work gauging the ages 
of planets and stars to arrive at the answer. 


Our Magnetic Earth: The Science of 
Geomagnetism 

Ronald T. Merrill (Univ. Chicago Press, 2012; 
$17.00) 

Earth scientist Ronald Merrill draws together 
topics relating to geomagnetism, from mammals 
using the magnetic field for sensing to what 
magnetized rocks reveal about Earth’s history. 
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with the tale of the agronomist and 
pseudoscientist Trofim Lysenko, whose 
theory of heredity, supported by Joseph 
Stalin, came close to destroying the sci- 
ence of genetics in the Soviet Union. One 
reason scientists so strongly opposed 
Velikovsky was the recent, chilling mem- 
ory of Lysenkoism. 

There are fascinating and alarming 
parallels between these outsider narratives 
and Christian creationists’ use of pseudo- 
scientific arguments to lend legitimacy to 
their cause. Henry Morris and John Whit- 
comb’ 1961 publication The Genesis Flood 
(Presbyterian and Reformed Publishing) 
became the foundation of the ‘creation 
science’ movement. Like Velikovsky, these 
authors postulated a catastrophic history of 
Earth, reinterpreting all geology in terms 
ofa single universal flood, as described in 
chapters 6-11 of Genesis. They based their 
conclusions solely on a literal interpreta- 
tion of scripture, and rejected Velikovsky’s 
naturalist explanations. 

Pseudoscience that has the support of 
organized religion or economic interests 
tends to survive much longer than the work 
ofa lone eccentric such as Velikovsky. In his 
final pages, Gordin touches ona new phase 
of pseudoscience, practised by a few rogue 
scientists themselves. Climate-change 
denialism is the prime example, in which a 
handful of researchers, allied with an effec- 
tive public-relations machine, are publicly 
challenging the scientific consensus that 
global warming is real and is attributable 
mainly to human consumption of fossil 
fuels. Deniers’ questioning of the climate 
data and their attacks on the integrity and 
motivations of the scientists involved was 
exemplified by the e-mail-hacking ‘climate- 
gate’ scandal of 2009. This is perhaps the 
greatest threat from pseudoscience today. 

Velikovsky and Lysenko may be largely 
forgotten, but other forms of pseudo- 
science are flourishing — among both the 
public and many politicians. Gordin’s his- 
torical analysis of pseudoscience remains 
disturbingly relevant. = 


David Morrison is the director of the 
Carl Sagan Center for Study of Life in the 
Universe in Mountain View, California. 
e-mail: dmorrison@seti.org 


Rise of Sensibility 


The Collapse of Mechanism and the 


Encounters with 
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the nonexistent 


Dominic ffytche contemplates Oliver Sacks’ journey 
through the past and future science of hallucinations. 


liver Sacks’ Hallucinations opens 
()s something of an apology. 

The neurologist explains that his 
book will not focus on cutting-edge brain- 
imaging studies of what happens during a 
hallucination or what predisposes individu- 
als to having them; rather, it will concentrate 
on first-person accounts, with the aim of 
understanding what it is like to hallucinate. 
It is as if Sacks worries that such an approach 
is insufficiently scientific, in an era of func- 
tional and structural neuroimaging. 

Such concerns are unfounded. Imaging 
studies have made significant contribu- 
tions to the field, but it is still Sacks’ clinical 
territory — where the listening, archiving 


Stephen Gaukroger (Oxford Univ. Press, 2012; a 
£25) froverse 


and recognition of 
repeated patterns 
of symptoms takes 
place — that sets the 
research agenda. 

By analogy to 
eighteenth-century 
catalogues of flora and 
fauna from different 


SS. 


Hallucinations 


OLIVER SACKS corners of the world, 
Picador/Knopf: Sacks structures his 
2012. 336/352 pp. book as a ‘natural his- 
£18.99/$26.95 


tory’ of hallucinations. 
Chapter by chapter, he describes the various 
clinical and physiological settings in which 
people have hallucinations. They might > 


A Universe from Nothing: Why There Is 
Something Rather than Nothing 

Lawrence M. Krauss (Simon & Schuster, 2012; 
£8.99) 


Historian of philosophy Stephen Gaukroger 
charts how a sensory view of nature coincided 
with the novel’s rise in 1680-1760. (See George 


Rousseau’s review: Nature 470, 462-463; 2011.) 


Nothing 


The current thinking on multiverses, dark energy 
and what is meant by ‘nothing’ is unpicked by 
theoretical physicist Lawrence Krauss. (See Caleb 
Scharf’s review: Nature 481, 440; 2012.) 
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> see indecipherable writing, or miniature 
costumed figures (visual hallucinations); 
smell dill pickles (olfactory hallucinations); 
or hear incessant renditions of the song 
White Christmas (musical auditory hallu- 
cinations). 

Using descriptions from his patients, cor- 
respondents and published accounts, Sacks 
covers areas as diverse as eye disease, stroke, 
drugs (in poignant revelations from his own 
past) and bereavement, as well as bizarre 
perceptual experiences that defy simple 
classification. Such classificatory dilemmas 
include the case of Sarah, who saw herself 
from the ceiling of an operating theatre 
(in an ‘out of body experience); and Ellen, 
whose brother’s profile remained fixed 
in her vision for days (visual persever- 
ation). What all such experiences share 
is that the perceived object either isn't 
there or differs in some way from how 
others would perceive it. 

In the neurology clinic, hallucina- 
tions tend to be visual; in the psy- 
chiatric clinic, the overwhelming 
majority are auditory — specifically, 
voices. Sacks writes from a neurolo- 
gist’s perspective, so his catalogue of 
visual phenomena and other neuro- 
logical symptoms is comprehensive 
and scholarly. Hallucinated voices and 
other manifestations of schizophrenia 
and bipolar disorder he mentions 
only in passing. They require, he 
suggests, a book of their own. 

Hallucinations gives 
Sacks the opportunity 
to explore an area new 
to him: the paranormal. 
Before the twentieth cen- 
tury, hallucinations were often 
seen as belonging to the domain 
of spiritualism, ghost hunting, telepathy, 
crystal gazing and divination. The most 
detailed survey of hallucinations ever 
undertaken remains the International Cen- 
sus of Waking Hallucinations in the Sane, 
conducted in the 1890s for the Society for 
Psychical Research in London, to uncover 
evidence for communication from the dead. 

Chapter titles such as ‘Phantoms, 
Shadows, and Sensory Ghosts’ and “The 
Haunted Mind’ remind us of these meta- 
physical probings. Hallucinations and 


other anomalous brain experiences, Sacks 
suggests, lie behind psychical and para- 
normal phenomena, as well as fairy-tale 
and folklore descriptions of devils, witches, 
elves and leprechauns (or, more recently, 
alien encounters). 

Sacks’ book comes at a time of exciting 
developments in hallucination science. A 
key challenge to progress at present is the 
question of whether all types of hallucina- 
tion are caused by the same brain mecha- 
nism, or by several distinct ones. In Sacks’ 
natural-history terms, are hallucinations 
a single species of symptom or a family of 
species? It seems probable that the answer 
is a mixture of the two. Although they are 


A KEY CHALLENGE 
IS WHETHER ALL TYPES OF 
HALLUCINATION ARE CAUSED 
BY THE SAME 


BRAIN MECHANISM. 


very different, hallucinations originating in 
eye disease, hearing loss, amputation, sen- 
sory deprivation or stroke might be consid- 
ered a single species because, for each, loss 
of input to the brain triggers a sequence of 
neurophysiological changes that results in 
spontaneous neural firing and hallucina- 
tions. 

Another cause of hallucinations, which 
might be considered a second species, is 


faulty efference signals. These signals come 
from the motor system and spread through- 
out the brain to inform it of impending 
movement or action. In current psychiatric 
accounts of hallucinations, faulty efference 
signals are thought to cause a failure to rec- 
ognize thoughts, actions or inner speech 
as originating with oneself, so that they are 
perceived as spoken or controlled by others. 
Other hallucination theories are based 
on knowledge stored in the brain that influ- 
ences what we perceive through expectation 
or preoccupation. Sacks tells us of Marion, 
for example, who while grieving the loss of 
her husband, heard and saw him greeting 
her as she returned home — a ‘top-dowr’ 
influence that might be considered a 
third species of hallucination. 
Current research is seeking to 
establish whether these apparently 
different species can be reconciled ina 
single overarching mechanism. What- 
ever the outcome, future classificatory 
schemes of hallucinations and related 
phenomena are likely to be very dif- 
ferent from those we have now. 
Hallucinations also take us to the 
edge of our understand- 
ing of perceptual con- 
sciousness by hinting 
at brain functions and 
processes that we did 
not even know were 
there. There is nothing in our 
models of object vision that 
explains Rosalie’s descrip- 
tions, in Hallucinations, 
of silent multitudes 
in elaborate East- 
ern dress, or Marlon 
seeing his home, in real- 
ity ordered and tidy, as a 
world of chaotic dilapidation. It 
is observations such as these, derived from 
the consulting room, that challenge the 
neuroscience of perception — and Sacks is 
leading the way. = 


Dominic ffytche is Clinical Senior 
Lecturer in Old Age Psychiatry at King’s 
College London. He specializes in visual 
hallucinations and related visual perceptual 
disorders. 

e-mail: dominic. ffytche@kcl.ac.uk 


The Better Angels of Our Nature: Why Violence 
Has Decreased 

Steven Pinker (Penguin, 2012; $20) 

We are becoming less violent thanks to societal 
evolution, avers psychologist Steven Pinker, 
citing significant evidence showing that warfare 
and murder rates are falling. (See Martin Daly’s 
review: Nature 478, 453-454; 2011.) 


In Other Worlds: SF and the Human Imagination 
Margaret Atwood (Virago, 2012; £9.99) 

The creator of dystopian worlds in Oryx and 
Crake and The Year of the Flood looks to her own 
work and that of authors such as Kazuo Ishiguro 
and H. G. Wells in a collection of essays 
exploring science fiction. (See Q&A: Nature 478, 
35: ZO IL) 
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FOOD SECURITY 


a 


Growing trouble 


Calestous Juma assesses a call for policy to feed the world 
at a time of squeezed production and soaring prices. 


e forget that part of the impetus 
behind the Arab Spring was the 
food price spikes of 2007-8. As, 


globally, we continue to deal with the political 
and diplomatic ramifications of that, another 
price spike is projected — this time asa result 
of serious recent drought in the United States, 
Latin America and South Asia. 

These events are key to assessing the pol- 
icy relevance of environmental analyst Lester 
Brown's Full Planet, Empty Plates. Brown's 
message is that the deteriorating world food 
situation has far-reaching geopolitical rami- 
fications that demand urgent policy action. 
Saving civilization, says Brown, needs to be a 
contact sport, “not a spectator sport”. 

The book offers some general indication 
of how that saving might happen — not 
least through galvanizing the international 
community. And it stresses the global and 


Is That a Fish in Your Ear? The Amazing 
Adventure of Translation 

David Bellos (Particular Books, 2012; £8.99) 
Culture, nationalism and semantics all feature 
as translator David Bellos gives both a history of 
the shifting meaning of translation and practical 
insight into the complexity it involves. (See Ellen 
Bialystok’s review: Nature 477, 536; 2011.) 


systematic nature of ee 
the challenges. But emery s¥°". 
the book is silenton “mu, 
the urgent need for an 
institutional response 
that matches the mag- 
nitude of the problem. leer Rg, 
Brown argues that és 
the biggest threat to 
global stability is the 
potential for food 
crises in develop- 
ing countries. That 
risk is exacerbated by 
population growth: 
some 219,000 peo- 
ple are joining the dinner table every night. 
Meanwhile, the rise in affluence is driving 
up demand for grain-intensive livestock and 
poultry products — at a time when nearly 


Full Planet, Empty 
Plates: The New 
Geopolitics of Food 
Scarcity 

LESTER BROWN 

W. W. Norton: 2012 
160 pp. $27.95, £20 
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one-third of US grain output is being diverted 
from food to fuel for cars. These challenges 
are compounded by degradation of farm- 
lands, water scarcity, climate change and the 
fact that grain yields are starting to plateau. 

According to Brown, the overall effect of 
these dynamics is a dangerous transition 
from a time of plenty to one of food scarcity. 
The result is a global rush for land, and “a 
new geopolitics of food”. Brown avers: “Food 
is the new oil. Land is the new gold” Yet the 
newness of this thesis is overstated: food has 
always been a geopolitical issue. The green 
revolution of the 1960s and 1970s, for exam- 
ple, was inspired by geopolitical considera- 
tions; one being the possibility of popular 
uprisings following famines in countries such 
as India and Mexico, as was well documented 
by John Perkins in Geopolitics and the Green 
Revolution (Oxford University Press, 1997). 

How does Brown envisage tackling the 
dilemmas? He asserts that conventional sup- 
ply-oriented policy measures such as offering 
farmers financial incentives and higher price 
support are simply not enough. Measures of 
demand form the centrepiece of the book’s 
policy response. They include stabilizing the 
global population, eradicating poverty, reduc- 
ing meat consumption and changing biofuels 
policies by curtailing the growth of the indus- 
try and switching to electric vehicles. 

On the face of it, Brown's proposals seem 
sensible. However, the book focuses on what 
needs to be done, but provides very little 
indication of how such policies might be 
implemented. As a result, there are few new 
insights on how the world can avert the “food 
breakdown” Although poverty eradication is 
a desirable goal, so far it has remained elusive, 
and there are no new ideas here on tackling it. 

Probably the weakest aspect of Full Planet, 
Empty Plates is its failure to clearly outline 
the institutional mechanisms through 
which to address the challenge. So far, the 
world has tried to deal with the issue largely 
through international bodies such as the 
United Nations. Their efficacy is now in 
question, but there is no discussion of alter- 
natives. And the solutions Brown cites derive 
from national efforts. What works for one 
country may not work for all. 

For example, he alludes to efforts by organi- 
zations including the World Bank and the 
United Nations Food and Agriculture > 


The Quest: Energy, Security, and the Remaking 
of the Modern World 

Daniel Yergin (Penguin, 2012; $20) 

Oil pools at the centre of Daniel Yergin’s analysis 
of the challenge to supply the world with energy. 
Also focal is his belief that the energy question 
should be integral to foreign policy. (See Vaclav 
Smil’s review: Nature 477, 403; 2011.) 
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> Organization to work out principles 
for governing land acquisitions in devel- 
oping countries. This has emerged as one 
of the most controversial aspects of Afri- 
can agriculture, with most of the land deals 
concentrated in countries such as Ethio- 
pia, Ghana, Liberia, Madagascar, Mozam- 
bique, South Sudan and Zambia. 

These are relatively poor countries that 
need enhanced capability in resource 
management. Simply offering new guide- 
lines wont do it. Ethiopia, for example, has 
along-term vision to modernize its agri- 
culture, but Brown offers little indication 
of how such countries could play a part. 

There have been previous predic- 
tions of food crises. A variety of creative 
responses — especially those made possi- 
ble by advances in science and technology 
— helped to forestall disaster and buy the 
international community time. The green 
revolution, by introducing inputs such as 
fertilizer and high-yielding crop varieties, 
boosted food production. Brown ignores 
technology, however, even though almost 
all of the measures he proposes — from 
the management of population to the 
expansion of prosperity and switching of 
energy sources — require the marshalling 
of human ingenuity. 

The green revolution offers another 
important insight: Latin America and 
Asia responded to the food crises by 
expanding local food-production capac- 
ity. A global network of agricultural 
research institutes was created under 
the Consultative Group on International 
Agricultural Research (CGIAR) to pro- 
mote such capacity. The CGIAR is prob- 
ably the most important geopolitical 
innovation ever created in response to 
food crises. The world will need to either 
strengthen it or offer an alternative. 

Full Planet, Empty Plates is a call to 
arms against the wolfat the door. Admit- 
ting the presence of a wolf is one thing; 
designing an attack strategy is another. 
For that, we need to turn elsewhere. = 


Calestous Juma is professor of the 
practice of international development at 
Harvard Kennedy School, Cambridge, 
Massachusetts, USA. 

e-mail: calestous_juma@harvard.edu 


BIOTECHNOLOGY 


Random harvest 


Ian Wilmut assesses a critique of how genomics, 
bioethics and neuroscience are meeting their potential. 


he subtitle of this book — Bioscience’ 
[Perera Promises — led me to 

expect I would be reading about 
the harm caused by biomedical research. 
Instead, I found a scathing account of the 
failure of recent projects in biology to pro- 
vide significant new knowledge. 

In Genes, Cells and Brains, their fifth book 
together, sociologist Hilary Rose and her 
husband, neuroscientist Steven Rose, con- 
sider in detail several disciplines that have 
become fashionable during the past 30 years. 
These include genomics, experiments on 
animals, biobanks, regenerative medicine 
and neuroscience. The duo also reviews 
mechanisms of evolution and informatics. 

The authors introduce each theme with 
a historical account of its scientific, ethical 
and sociological background, according 
to their views. Anyone who has read Love, 
Power and Knowledge: Towards a Feminist 
Transformation of the Sciences (Polity, 1994) 
by Hilary Rose and Alas Poor Darwin: Argu- 
ments Against Evolutionary Psychology (Jon- 
athan Cape, 2000), edited by both authors, 
will know broadly what to expect from 
their socialist and feminist perspectives. 
So there is criticism of modern academics 
being encouraged to patent their research 
or becoming involved in commercializing 
their work, and of pre-implantation genetic 
diagnosis. Whether or not you agree with the 
Roses’ perspectives, they provide thought- 
provoking and interesting contrasts to the 
secular, neo-liberal view that predominates 
at present. 

The authors describe why, in many cases, 
they believe that misunderstandings about 
the underlying biology inevitably led to 
projects failing to achieve their aims. In 
their view, for example, the gene-centric 
view of biology exaggerates the probabil- 
ity of finding linkages between genes and 


disease. They also 
forcefully criticize 
today’s free-market 
capitalist global 
economy, suggesting 
that the undesirable 
social environment it 
creates often leads to 
inappropriate organi- 


Genes, Cells 


and Brains: zation of bioscience 
Bioscience’s projects, with, for 
Promethean example, serious risks 
Promises of personal data being 
iat i D released into the pub- 
N . . 
Verso: 2012. 336pp, ‘ie domain. 
£20, $26.95 They address their 


criticisms particularly 
at the genome-wide association studies of 
human inherited disease that have sprung 
up since the human genome was first 
sequenced. In such studies, the genomes of 
populations in a community are analysed in 
a search for evidence that a specific form of 
a gene (allele) is associated with a high risk 
of the person having a serious illness. So far, 
these projects have failed to fulfil the unreal- 
istic expectations that were raised by public- 
ity from the research community. 


But Will the Planet Notice? How Smart 
Economics Can Save the World 

Gernot Wagner (Hill and Wang, 2012; $16) 

You can avoid meat, plastic bags and air miles, 
writes economist Gernot Wagner, but individual 
choices have no effect on the planet. Harnessing 
market forces to incentivize green behaviour is 
the key to confronting climate chaos, he argues. 


Alone Together: Why We Expect More from 
Technology and Less from Each Other 

Sherry Turkle (Basic Books, 2012; $16.99) 
Computers are changing the way we interact, 
warns social scientist Sherry Turkle, leading 

us to view digital connections as a substitute 

for intimacy. Many people are ‘always online’; 
others have never known life without computers. 
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I must declare an interest: Iam a mem- 
ber of the group that the Roses hold largely 
responsible for the failure of the “New Biol- 
ogy” to provide important new knowledge. 
Iam a 68-year-old Caucasian male who 
has worked in biotechnology for the past 
30 years; I am also one of the inventors 
named on several patents. At one point I 
was an employee and shareholder of Geron 
in Menlo Park, California, a biopharmaceu- 
tical company that is criticized in one section 
of the book. 

The authors repeatedly raise two general 
concerns about biotechnology. They criti- 
cize the extent to which many biologists, 
adopting a gene-centric view, attribute 
aspects of human behaviour — including 
sexuality, an inclination to shoplift and 
altruism — to the influence of specific 
genes. They make the point that humans 
and fruitflies have a very similar number of 
genes, but very different levels of complex- 
ity, with the implication that genes cannot 
account for everything. 

In this context, it is surprising to see so 
little mention of the apparent role of the 
different populations of RNA molecules in 
generating complex functions. Reports of 
the ENCODE project, which is providing 
the first detailed map of these regulators, 
were published in September. However, 
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the general concept that non-coding RNA 
sequences have a vital regulatory function is 
more than a decade old (J. S. Mattick EMBO 
Rep. 2, 986-991; 2001). 

The authors’ second beef is that commer- 
cialization now so dominates science that it 
distorts research management and priori- 
ties. They assert, for example, that succes- 
sive British governments have supported 
research with human embryonic stem cells 
because of the potential for wealth creation. 
To those of us involved in the research, these 
unique cells should be thoroughly investi- 
gated because of the potential to treat debili- 
tating degenerative diseases for which there 
is currently no treatment. 

Without wishing to ignore their charge, I 
would balance the discussion by mention- 
ing the private sector's very large investment 
in biotechnology. The authors surely do not 
imagine that governments would provide 
such funding if private investment ceased? 

What of their overall comments about 
genomics? No doubt many of those who 
were involved in the research are as disap- 
pointed as the authors by the limited clini- 
cal benefit so far. Experience shows that it is 
difficult to predict the outcome of research 
and that it is hard to estimate the use that can 
be made of new knowledge. However, thou- 
sands of academic and clinical laboratories 
are making daily use of information about 
the human genome and of the methods that 
have been developed during the past decade 
for measuring gene expression in a small 
sample of cells. 

I share the authors’ disappointment at the 
rate of progress. But I am convinced that, in 
time, these fields will provide treatments for 
diseases such as motor neuron disease, mul- 
tiple sclerosis, Parkinson's disease, diabetes 
and some forms of heart disease — eventu- 
ally having a beneficial effect comparable 
to the revolutionary control of infectious 
diseases achieved during the past century. 
Sadly for those currently afflicted, it seems 
that it will take several decades for this to 
happen. = 


Ian Wilmut is professor emeritus in the 
University of Edinburgh's Medical Research 
Council Centre for Regenerative Medicine, 
of which he was the founding director. 
e-mail: iwilmut@staffmail.ed.ac.uk 


The Magic of Reality 

Richard Dawkins and Dave McKean (Doubleday, 
ZOWA TOO 9) 

In this edition of his illustrated bestseller for 
young people, eminent biologist Richard 
Dawkins aims to reveal scientific truth as more 
magical than myth, while Dave McKean’s 
photoreal images viscerally evoke the wonder. 
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The Fair Society 

Peter Corning (Univ. Chicago Press, 2012; $17) 
Asense of fairness is innate in humans, argues 
Peter Corning. He cites examples such as public 
outcry at discrimination cases and divorce- 
court wrangling as evidence that society is 
underpinned by notions of reciprocity, justice 
and what is fair. 
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Reallocate funds to 
boost performance 


Last month’s report by the 
Council of Canadian Academies, 
prepared at the government's 
request, reveals that Canada’s 
research output and impact rank 
seventh and sixth in the world, 
respectively (see go.nature.com/ 
pyvwlk). Shouldn't the country 
be performing better, given its 
sustained investment in research 
and development (R&D)? 

Gross domestic expenditure 
on R&D by the government 
and higher-education sectors 
was 0.77% of gross domestic 
product (GDP) in 2008 and has 
been maintained throughout 
the global financial crisis (0.79% 
in 2010). Although Canada 
performs well given its small 
population, its GDP is relatively 
high because of natural-resource 
exports, so these figures may be 
underestimates. Its 2008 R&D 
investments exceeded those of 
Japan and of every European 
Union member except Finland 
and France (see go.nature. 
com/59dg8f). 

Canadian academia can 
improve returns on this public 
investment. Stimulating 
competition and responsibility 
for delivering research output 
and impact would support the 
government's commitment to 
fiscal accountability. For example, 
it could assess researchers every 
5-10 years and reallocate a small 
amount of existing public funding 
to the highest performers. These 
measures could bring Canada 
in line with other developed 
countries, such as the United 
Kingdom and Australia. 

Andrew J. Tanentzap York 
University, Toronto, Canada. 
atanzap@yorku.ca 


Social insight rings 
true 125 years on 


David Rand and colleagues 
pose the question: “Are we 
intuitively cooperative, with 
reflection upon the logic of 
self-interest causing us to rein 


in our cooperative urges and 
instead act selfishly?” (Nature 
489, 427-430; 2012). In 
answering affirmatively, they 
confirm an observation made 
125 years ago by the Republican 
Senator for California, Leland 
Stanford. 

Stanford had introduced a bill 
to foster worker cooperatives 
in 1886, and was interviewed 
by The Cincinnati Enquirer. 
When Stanford broached the 
issue of women's rights, the 
reporter asked him, “Do you 
not think women will go off 
on sentimental issues if they 
undertake the business of 
government and break up the 
organizations by which men 
work out large ends?” Stanford 
replied, “Oh! It is not sentiment 
that we have to fear so much as 
we suppose. A man’s sentiments 
are generally just and right, 
while it is second selfish thought 
which makes him trim and 
adopt some other view. The best 
reforms are worked out when 
sentiment operates, as it does in 
women, with the indignation of 
righteousness.” (See go.nature. 
com/i6b91d.) 

Stanford’s discernment of 
rapid, pro-social sentiment 
and slower, selfish calculation 
is but one part of a larger body 
of thought on the potentials for 
cooperation in the economy, 
which we may find worth 
revisiting today. 
Lee Altenberg Maui, Hawaii, 
USA. 
altenber@hawaii.edu 


Beit fellowships 
forge a Nobel link 


The Beit Memorial Fellowship 
scheme, the first specifically 
to support young UK medical 
researchers, has madea 
remarkable contribution to 
British biomedical science 
over the past 100 years. John 
Gurdon’s Nobel prize this year 
(see go.nature.com/yo3cxf) is 
the seventh to be awarded toa 
former Beit fellow. 

Other Nobel laureates who 


once held Beit fellowships were 
Alexander Todd (chemistry, 
1957), Fred Sanger (chemistry, 
1958 and 1980), Macfarlane 
Burnet (physiology or 
medicine, 1960), Bernard Katz 
(physiology or medicine, 1970) 
and Tim Hunt (physiology 

or medicine, 2001). Some 

10% of Beit fellows have 

gone on to become fellows of 
the Royal Society, including 
Marjory Stephenson (1914 Beit 
fellowship), who became one 
of the first two female Royal 
Society fellows in 1945. 

The scheme merged with the 
Wellcome Trust Intermediate 
Fellowship in 2009 to create the 
Wellcome-Beit Prize Fellowships 
(see go.nature.com/i2tofa). 
Andrew McMichael Weatherall 
Institute of Molecular Medicine, 
University of Oxford, UK. 
andrew.mcmichael@imm.ox.ac.uk 
Cyril Chantler UCL Partners 
Academic Health Science 
Partnership, London. 


Timely reminder to 
return to the Moon 


The death of astronaut Neil 
Armstrong (see Nature 489, 
368; 2012) reminds us that, 

43 years after he set foot on the 
Moon, the satellite is still a hot 
target. 

It is the best and most 
accessible place in the Solar 
System for robotic and human 
assets to address fundamentally 
important scientific questions 
while expanding our 
technological capabilities. 

Most of the Moon remains 
unexplored: we have never been 
to its far side, or to its polar 
regions. 

In the last year of his life, 
Armstrong told a committee 
appointed by the US Congress 
that the larger human- 
exploration goals lie beyond low 
Earth orbit, namely: “Luna, the 
lunar Lagrangian points, Mars 
and its natural satellites, and 
near-Earth objects including 
meteoroids, comets and 
asteroids”. He went on: “Last year 


I testified to this committee on 
the rationale for selecting Luna 
and its environs as the preferred 
initial option for America’s 
exploration beyond Earth orbit. 
All that I have learned in the 
past year has just reinforced that 
opinion.” 

Let’s hear and act on those 
words. Rather than exploring 
new options, we should begin 
building the hardware, selecting 
the landing sites, training the 
crews and planning the details 
of missions that take us to the 
Moon anda future of new 
discoveries. 

David A. Kring Universities 
Space Research Association, 
Lunar and Planetary Institute, 
Houston, Texas, USA. 
kring@lpi.usra.edu 

Jack O. Burns University of 
Colorado Boulder, Colorado, USA. 
D. Ben J. Bussey Johns Hopkins 
University Applied Physics 
Laboratory, Laurel, Maryland, 
USA. 
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Bureaucracy bypass 
let research flourish 


The remarkable success of the 
UK Laboratory of Molecular 
Biology in Cambridge 

(W. Bynum Nature 490, 31-32; 
2012) might be explained in the 
words of Max Perutz. When 
asked what makes creative 
research, he would say: “No 
politics, no committees, no 
referees, just talented highly 
motivated people” (G. Radda 
Nature Med. 8, 205; 2002). 

Four of the greatest discoveries 
of the twentieth century — the 
structure of the atom, quantum 
mechanics, the theory of relativity 
and the structure of DNA — were 
made without project reviewers or 
grant-giving agencies (B. Gal-Or 
Cosmology, Physics, and 
Philosophy p. 493; Springer, 1983). 
Min-Liang Wong College of 
Veterinary Medicine, National 
Chung-Hsing University, 
Taichung, Taiwan. 
mlwong@dragon.nchu.edu.tw 
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OBITUARY 


Louise Johnson 


(1940-2012) 


Biophysicist who helped to establish the field of structural biology. 


ouise Johnson transformed our under- 
[= of how complex enzymes and 

other proteins work. By combining her 
in-depth knowledge of X-ray crystallogra- 
phy with a long-standing interest in bio- 
chemistry, she helped to launch structural 
biology as a new discipline. 

Johnson, who died on 
25 September, was born 
on 26 September 1940 
in Worcester, UK. She 
attended Wimbledon High 
School for Girls in Lon- 
don and then completed a 
degree in physics at Univer- 
sity College London. 

In 1962, Johnson started 
a PhD at London’s Royal 
Institution — working 
under David Phillips, a pio- 
neer of protein crystallogra- 
phy. There she contributed [RUE 
to studies on lysozyme, an 
enzyme that is abundant in 
various substances, such as 
tears and egg whites, and 
which breaks down the cell 
walls of bacteria. Even at 
this early stage in her career, 
Johnson was not interested 
in just working out struc- 
tures, but wanted to use 
an understanding of structure to uncover 
fundamental biochemical mechanisms. 

By firing X-rays at the crystallized pro- 
tein and studying the angles and intensities 
of the diffracted beams, Phillips estab- 
lished lysozyme’s structure in 1965. This 
was the second protein structure to be 
solved by X-ray crystallography (the first 
was myoglobin, found in muscle). John- 
son’s contribution was to work out what 
lysozyme looked like when it was bound 
to N-acetyl-glucosamine, a component of 
bacterial cell walls. Johnson and Phillips's 
work led to the first structural evidence 
that a substrate slots into an enzyme, much 
as a key fits into a lock. 

After her PhD, Johnson spent a year at 
Yale University in New Haven, Connecti- 
cut, where she worked as a postdoc with bio- 
physicist Fred Richards. Here she uncovered 
the structure of the enzyme ribonuclease S, 
showing that X-ray crystallography could be 
applied to proteins that were more complex 
than lysozyme. In 1967, she returned to the 
United Kingdom and rejoined Phillips in the 


Laboratory of Molecular Biophysics at the 
University of Oxford. 

I first encountered Johnson about ten 
years later. I was a first-year undergradu- 
ate at Oxford studying biochemistry, and 
my tutor sent me to her to learn the basics 


MISH] 
Wi // 
Wii 
Wy 


of how protein structures are determined. 
Instead of providing me with a standard 
textbook description, Johnson threw me in 
at the deep end. I was soon working through 
Fourier transforms — mathematical descrip- 
tions of what happens when X-rays are 
scattered by protein molecules. 

In 1990, Johnson was made the David 
Phillips Professor of Molecular Biophysics, 
and she remained in this post until her 
retirement. By this time, the Laboratory 
of Molecular Biophysics had become part 
of Oxford’s biochemistry department. 

From 2003, Johnson combined her 
position at Oxford with leading the develop- 
ment of the UK national synchrotron, the 
Diamond Light Source near Didcot. The 
largest science facility to have been built 
in the country for more than 40 years, the 
synchrotron produces beams of light that 
can be used to investigate the structure and 
properties of materials including proteins. I 
recall accompanying her (both of us wearing 
hard hats and boots) as she showed me the 
nascent facility with evident pride. 
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Johnson’s many achievements included 
establishing the structure of a large and 
complex enzyme called glycogen phos- 
phorylase. Present in muscle, this enzyme 
turns inert glycogen into the sugar needed 
to power physical activity. Johnson showed 
how the addition or removal 
of phosphate groups from 
the protein regulates its 
activity. (Phosphorylation 
has since turned out to bea 
key form of regulation in all 
sorts of cellular processes.) 
She subsequently carried 
out a set of groundbreak- 
ing studies on proteins that 
have key roles in the regula- 
tion of cell division. 

In all this work, Johnson 
demonstrated that X-ray 
crystallography could reveal 
detailed catalytic and regu- 
latory mechanisms, and can 
potentially unmask how 
large proteins work as com- 
plex biochemical machines. 
Her 1976 book Protein Crys- 
tallography (co-authored 
with Tom Blundell) was for 
many years the classic text- 
book on the topic. 

Johnson received many 
honours in recognition of her work, includ- 
ing being made a fellow of the British Royal 
Society in 1990 anda foreign associate of the 
US National Academy of Sciences in 2011. 

Louise was very supportive of young 
scientists, particularly women. Married, 
with two children, to physicist and Nobel 
laureate Abdus Salam, she understood the 
challenges of juggling a scientific career 
with looking after a family. A quiet and kind 
person, she directed her students and fellow 
researchers carefully but gently. The harsh- 
est comment I remember receiving from 
her — on my hastily written grant proposal 
— was “perhaps a little nebulous”. She was a 
source of inspiration to all around her, and 
will be greatly missed. m 


Mark Sansom is the David Phillips 
Professor of Molecular Biophysics in the 
Department of Biochemistry, University 
of Oxford, Oxford OX1 3QU, UK. He was 
Louise Johnson’ graduate student from 
1979 to 1983. 
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WORK EXPERIENCE 


Stepping 


Internships can help scientist job seekers to get ahead start 
on the competition. 


BY AMANDA MASCARELLI 


a six-year postdoctoral position in parasi- 

tology at the University of California, Los 
Angeles. Having completed a PhD in molecu- 
lar and cellular pharmacology, he was primed 
for a position in academia. But he hungered 
to move into industry. “I really wanted to take 
what Id learned and apply it, and I felt that 
academia really didn’t offer me that option 
whereas industry did,” says Brown. 

But the economy was in trouble and industry 
jobs in research and development were disap- 
pearing. Prospects looked grim. Brown learned 
about the Postdoc Professional Masters (PPM) 
at the Keck Graduate Institute of Applied Life 
Sciences in Claremont, California, one of the 
growing number of programmes that aim to 
provide practical, hands-on business savvy to 


LE late 2009, Mark Brown was at the end of 


stones 


graduates with PhDs and medical doctorates 
who want to enter the biosciences industry (see 
Nature 469, 569; 2011). 

Before he started the course, Brown was 
keenly aware that he faced stiff competition 
from more-experienced job seekers. “It was 
going to be a hurdle to make that jump,’ he 
says. “I did have a lot of technical skills, but 
I felt I didn’t have the experience that indus- 
try might want.” The two-semester PPM pro- 
gramme encourages students to participate in 
industry-sponsored internships, and Brown 
landed a placement with Claremont BioSolu- 
tions, a small company that develops research 
tools for scientists, based in Upland, Califor- 
nia. The internship taught him about market- 
ing strategies, developing a product and the 
culture of a start-up company. After Brown 
graduated from Keck in 2010, Claremont Bio- 
Solutions hired him as a senior scientist. 


US researchers trained on the academic 
track are increasingly pursuing job opportuni- 
ties outside academia. The US National Science 
Foundation’s 2010 Survey of Earned Doctorates 
found that only just over half of new, employed 
US PhD recipients across all fields hold 
research positions at academic institutions. 

Yet conventional science and engineering 
PhD programmes in the United States don't 
always provide the practical experiences and 
skills needed to make a transition to industry. 
The flagging economy and increased competi- 
tion for fewer jobs has made preparation for 
the job market even more crucial. Internships 
— whether offered as part of a graduate degree 
or pursued independently — can provide nec- 
essary skills, look good on a CV and offer a 
taste of the world outside academia. 

More and more young US scientists are pur- 
suing internships and cross-training opportu- 
nities that will give them an edge. Outside the 
United States, they are already common: in 
Germany, for example, internships and applied 
industry experience are often integral to grad- 
uate training (see “Part of the package’). Many 
employers and educational institutions in the 
United Kingdom recognize the value of intern- 
ships or placements for graduate students, notes 
Janet Metcalfe, head of Vitae, a research-career 
advisory organization in Cambridge, UK. 


NOT-SO-ACADEMIC HURDLES 

Internships take various guises. But they gen- 
erally last from two to six months, often over 
the summer. Graduate or postgraduate interns 
are usually paid salaries comparable to gradu- 
ate stipends and postdoc wages. 

Academic programmes are increasingly ena- 
bling students to gain practical work experience 
through fellowships, interdisciplinary collabo- 
rations, traineeships and other forms of profes- 
sional development. And internships may be 
directly related to a student's research, or serve 
as segues to opportunities in areas such as com- 
munications, venture capital, law, clinical-trial 
management or intellectual property. 

But even as awareness grows about non- 
academic tracks, many institutions and faculty 
members have not adjusted. Graduate advisers 
might not support internships that take time 
away from the student’s primary research — 
grant money is limited, and PhDs already take 
along time to complete. Doctoral programmes, 
especially in the life sciences, don’t typically 
encourage internships, says Jung Choi, direc- 
tor of the bioinformatics master’s programme 
at the Georgia Institute of Technology in > 
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> Atlanta. “There are all kinds of disincentives 
for both faculty and students,’ he adds. 

Sudipto Guha, a chemical-engineering PhD 
student at the University of Illinois at Urbana- 
Champaign, found that there was little prece- 
dent among his peers for seeking an internship. 
“Tt’s a rare case that you would find an adviser 
who would let you do an internship in gradu- 
ate school,” he says. Yet Guha struck lucky: his 
adviser did support him, helping him to seek 
out opportunities and putting him in touch 
with companies. Guha networked aggressively, 
introducing himself to company representa- 
tives regardless of whether their firm offered a 
formal internship programme. This summer, 
he completed an internship at Brady Corpora- 
tion in Milwaukee, Wisconsin, which develops 
identification solutions such as barcode labels 
and safety markings. 


ALL IN THE PLAN 

Some US institutions have formally recognized 
the need for internships and other practical 
experience. In 2008, after collecting survey 
results suggesting that many students were 
interested in opportunities outside academia 
but lacked confidence in how to pursue them, 
the school of medicine at the University of 
California, San Francisco (UCSF), teamed up 
with other UCSF basic and biomedical science 
graduate departments to launch the Graduate 
Student Internships for Career Exploration 
Program (GSICE). 

“Our strategy was to convey the realities of 
where our graduate students are going in terms 
of careers, what our graduate students are inter- 
ested in, and to remember that UCSF has the 
responsibility to train our students for fulfilling 
careers,’ says Alexandra Schnoes, GSICE coor- 
dinator and one of two people who spearheaded 
the effort. Between January and March each 
year, students complete a series of workshops 


designed to help them to pinpoint the career 
choices most suitable for them. They can then 
pursue internships either during their graduate 
programmes (after passing the programmes’ 
qualifying exams and receiving approval from 
their advisers) or immediately after finishing. 
Bethann Hromatka, one of the first gradu- 
ate students to benefit from the internship 
programme, parlayed her placement into a 
full-time job. By the end of her PhD in bio- 
medical sciences at UCSF, Hromatka knew 
that she was not interested in a postdoc or aca- 
demia in general. “I was getting burnt out on 
the bench,” she recalls. “In my last year I had 
that conversation with my adviser, and at the 
end of the day she understood” In 2011, after 
defending her PhD, 
she wrapped up her 
lab work on a Sunday 
evening. The next 
morning, she began 
a two-month intern- 
ship at 23andMe, a 
personal-genetics 
start-up based in 
Mountain View, Cal- 
ifornia. Hromatka’s 
internship consisted 


“We don’t define mainly of researching 
success in an and writing genetic 
internship as health reports for the 
meaning they company’s website — 
get ajobin that and last November, 
field.” 23andMe hired her 
Terri O’Brien as a full-time health- 


content scientist, 
with the same core responsibilities. 

For those, like Hromatka, who have 
grown weary of bench work, internships can 
provide inroads into various career paths 
outside research and development. The Uni- 
versity of North Carolina at Chapel Hill offers 


GERMANY 


Part of the package 


Some US institutions are only now starting 
to incorporate non-academic career paths 
into graduate training, but at many German 
institutions, that emphasis is integral to 
doctoral training. In the past decade, such 
efforts have received an extra boost. 

In 2005, the government launched the 
German Excellence Initiative to support 
new strategies for research and graduate 
training (see Nature 487, 519-521; 2012). 
It invested around €2.1 billion (US$2.8 
billion) in the programme's first five years, 
and has now renewed it with another 
€2.5 billion for the next five years. Both 
phases have funded ‘research clusters’ 
and graduate schools at select institutions 
to foster international collaborations and 


internships across disciplines. 

That is part of a tradition of practical 
science-career planning in Germany. At 
the Technical University of Munich, for 
example, international internships are 
mandatory for all doctoral candidates and 
are backed with financial support from the 
university. The emphasis is on training for 
careers both inside and outside academia, 
says Ernst Rank, director of the institution’s 
graduate school. He points to surveys 
showing that some 80% of PhD students 
in Germany do not end up in academic 
positions. “Not one of our PhD students 
who graduates from our university would 
consider him or herself failed if they don’t 
make an academic career,’ says Rank. A.M. 


572 | NATURE | VOL 490 | 25 OCTOBER 2012 


© 2012 Macmillan Publishers Limited. All rights reserved 


S. PARIS 


a technology-transfer internship that gives 
graduate students and postdocs the opportu- 
nity to learn about intellectual property, pat- 
ent filings, licence agreements and technology 
commercialization through eight-month 
internships in the university's Office of Tech- 
nology Development. Interns commit 8-12 
hours per week to the training programme, 
which is carried out during the academic year, 
alongside the students’ main research. 

Cato Research, a clinical-research organiza- 
tion in Durham, North Carolina, runs a fel- 
lows programme offering doctoral students 
a one-year, full-time internship in clinical 
research and drug development. It acts as a 
sort of alternative postdoc and is becoming a 
popular choice, says Patrick Brandt, director 
of the office of science, training and diversity 
at the University of North Carolina, noting 
that in the past two years, an increasing num- 
ber of students have come to him asking about 
fellowships in clinical-trial management. 

At Stanford University School of Medicine 
in California, an internship programme for 
life-sciences PhD students is set to begin in 
2013. Career-centre advisers will help students 
to seek out 10-week placements in fields such 
as biopharmaceuticals, venture capital and 
finance, consulting, law and journalism. 

Internships don’t usually lead directly to 
full-time jobs at the host company or intern- 
ship site, says Terri O’Brien, associate dean 
for research strategy at the UCSF school of 
medicine. GSICE coordinators warn their 
students not to expect an immediate job 
offer, but they also point out that internships 
greatly improve the overall odds of landing 
a job. On the basis of preliminary data, the 
coordinators have found that more than 90% 
of their students who have done an intern- 
ship successfully move onto their chosen 
career path after graduation, compared with 
60% of those who have not. But most impor- 
tant are the insights into potential career 
tracks — and the confidence to pursue them. 
“We don't define success in an internship as 
meaning they get a job in that field,’ says 
O’Brien. “They may go through the GSICE 
programme's career assessment and training 
and realize they want another field. We're 
really trying to impact the career-decision 
process and the point at which it happens.” 


ANOTHER DEGREE? 

Pursuing a further degree may help students 
to build the practical experience needed to 
move into industry or other non-academic 
career paths. This is the explicit purpose of 
the Professional Science Master’s (PSM) and 
PPM programmes. Increasingly popular, these 
degrees generally involve up to two years of 
academic training and a professional com- 
ponent such as an internship or other type of 
cross-training opportunity in business, com- 
munications or regulatory affairs. The effect is 
“dramatic’, says James Sterling, vice-president 


for academic affairs at Keck. “PSM graduates 
have a foot in the door at these companies,” 
he says, noting that each year, some interns 
do go on to get jobs at the same firm. Sterling 
says that 95% of graduates from Keck’s PSM 
and PPM programmes find employment in 
industry within eight months of graduation 
— although sample sizes are small for the 
PPM, which is much newer. Keck PSM grad- 
uates usually have starting salaries of about 
US$70,000, and Keck PPM graduates about 
$80,000. 

“The internship 
definitely sharp- 
ened my focus,” says 
Neha Gupta, a bio- 
informatics analyst 
at the J. Craig Venter 
Institute in Rock- 
ville, Maryland, 
who earned a PSM 
in bioinformatics at 
the Georgia Insti- 


“Prof essional tute of Technology 
Science Master’s in 2010 and did an 
graduates have internship at Merck 
afootinthedoor — Research Laborato- 


at companies.” 
James Sterling 


ries in Rahway, New 
Jersey, the same 
year. “I wasn’t sure 
before if I wanted to go into a PhD programme 
or get a job. But after the internship I decided I 
definitely wanted to work in industry first,” she 
says. The Venter institute hired Gupta last year. 

Internships and other unconventional 
training opportunities complement, rather 
than detract from, students’ research, says 
Richard Linton, National Science Foundation 
dean-in-residence at the Council of Graduate 
Schools in Washington DC, a national organi- 
zation that promotes graduate education and 
research. Students who build industry con- 
nections will often create valuable collabora- 
tions, bring new expertise to their academic 
labs and gain access to facilities that reinforce 
their projects, he says. “Data suggest that these 
non-traditional tracks actually elevate student 
productivity and give them more motivation 
to complete their degrees and move forward 
into careers,” says Linton. “There’s much evi- 
dence that this isan enhancement, not a bar- 
rier, to degree completion” m= 


Amanda Mascarelli is a freelance writer 
based in Denver, Colorado. 


CLARIFICATION 

In the article ‘Turbulent times’ (Nature 
488, 685-688; 2012), we referred to 
research conducted by Ruth Miller. 

The work was done while she was at the 
Department of Social Studies of Science at 
the University of Vienna, in a team headed 
by Ulrike Felt. 
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EMERGING NATIONS 
Scientists wanted 


Researchers are in demand in east Asia, 
Latin America and southern Africa 

— regions not considered traditional 
scientific powerhouses — where doctoral- 
degree holders tend to leave academia 
for government or the private sector, 

says a report. CODOC — Cooperation on 
Doctoral Education Between Africa, Asia, 
Latin America and Europe, published on 
4 October by the European University 
Association (EUA) in Brussels, examined 
international collaboration across fields 
at universities to assess efforts to build 
research capacity. It documents ample 
international collaboration at universities 
in east Asia and an increasing number in 
Brazil and Mexico, but notes the need for 
more research investment in southern 
Africa to expand such efforts. Thomas 
Jorgensen, head of the EUA’ council for 
doctoral education and author of the 
report, says that increasing collaboration 
hasn't made it any easier to retain 
academic faculty members at universities 
in the three emerging regions. But early- 
career scientists who are willing to leave 
their home countries can find posts that 
match their research specialities, he says. 
“The need for early-stage researchers is 
desperately there,” adds Jorgensen. 


EUROPE 
Misconduct policy 


The European Research Council (ERC) 
has issued a strategy for identifying and 
addressing scientific misconduct — the 
first such policy on the European level. 
Under the strategy, released on 9 October, 
host institutions are responsible for 
detecting, investigating and adjudicating 
on suspected breaches of research 
integrity or misconduct by ERC-funded 
researchers or applicants for ERC 

grants. But the research council, based 

in Brussels, will also take action once 
institutions confirm cases of misconduct 
or questionable research practices: among 
other steps, it may notify the European 
Commission and the commission's anti- 
fraud office, hold a hearing, suspend or 
terminate grants or refuse further grant 
proposals. Since the ERC was launched in 
2007, it has issued some 3,000 grants and 
handled between 10 and 20 allegations 

of misconduct, says ERC vice-president 
Pavel Exner. On 17 October, the 
InterAcademy Council in Amsterdam and 
the InterAcademy Panel in Trieste, Italy, 
released Responsible Conduct in the Global 
Research Enterprise: A Policy Report to 
promote research integrity. 
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GLASS FUTURE 


BY DEBORAH WALKER 


he waitress seems reluctant to come 
Tem pretending not to see us, even 
though I’ve tried to catch her eye 
several times. Wed ordered our omelettes 
40 minutes ago. How long does it take to 
crack a few eggs into a hot pan? 

“Do you think she’s post- 
human?” I whisper to my hus- 
band. She looks too good to be 
real. 

Caleb glances over. “Maybe. 
She's very pretty, but mods are 
so subtle, it’s difficult to see 
who's human and who’s not.” 

I wonder what such an 
attractive looking woman is 
doing working in a low-rent 
place like this, a greasy-spoon 
cafe in a habitat on the edge of 
Rhea. 

We'd booked into the habi- 
tat’s motel last night. It reeked 
of overenthusiastic, grandiose 
plans for the future that would 
never come true. At dinner, Id 
watched the motel’s guests. I 
knew them, their small-time 
liaisons and their wild plans. They didn’t 
want much, just enough to be able to turn 
up on their home habitat and impress the 
ones who stayed behind, impress the ones 
who said theyd never amount to anything. 
They all ended up here, or someplace like 
it, scrabbling for success, trying to make a 
splash in an over-crowded system. This was 
a place for people whod never escape the 
gravity well of their own failures. 

It was a sad place to end a marriage. 

“Ts she ever going to come over?” Task. 

Caleb says: “I see that we will get the ome- 
lettes. They'll be... disappointing” 

Ismile. Caleb has a sense of humour about 
his gift. Even now, when he knows what I’m 
about to do, he still keeps cracking jokes. 

I take a deep breath and say: “I want a 
divorce. I wait a moment to see ifhe'’s going 
to make things easier on me. He doesn't say 
anything. I don't blame him. “I’m so sorry, 
Caleb.” 

“So am I He stares out of the window. 
“We're on opposite sides of the reflection, 
Alice. You knew that 


when you married NATURE.COM 

me.” Follow Futures: 
Ilook at his reflec-  @NatureFutures 

tion in the metal glass &@ go.nature.com/mtoodm 


A time to reflect. 


window. Caleb was a designer baby. A per- 
son designed for space. The multiple cop- 
ies of his genome in each cell protect him 
against ionizing radiation. But modding is 
always erratic. There's no way to predict how 
changes to the genome will affect the body 
— or the mind. Multiple-genome people, like 
Caleb, develop unusual connections in their 


brains. Precognition. They remember their 
future. And all of them are unable to pass the 
mirror test. They can see their reflections, but 
they can’t recognize themselves. Caleb hasn't 
got the self-awareness that most human 
babies develop at 18 months. That used to 
fascinate me, that lack of self. It seemed so 
strange, so exotic; now I find it sad. When 
love turns to pity, it’s time to end the relation- 
ship. Caleb didn't deserve my pity. 

I look beyond Caleb’s reflection to the 
habitat’s garden. Gardens don't thrive in 
space. The light collected from the solar foils 
and retransmitted to the plants is wrong. 
Earth plants either wither and die or they 
go wild. The habitat’s garden was overgrown 
and mutated. Swathes of honeysuckle, with 
enormous monstrous blooms, smothered 
everything. “It’s a pretty lousy garden,” 

“All these mutants should be cut away,” 
says Caleb. “I’m designing Zen gardens 
for the Oort habitats, swirls of pebbles, low 
maintenance.’ A heartbeat later, he says: 
“Why do you want a divorce, Alice?” 

He was going to make me say everything. 
“Ive met somebody else, while you were 
working on the Oort Cloud project.” Caleb’s 
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an architect, very much in demand in the 
ongoing push of colonization. 

“Did you?” The note of surprise in his 
voice is convincing. Caleb’s good at pretend- 
ing to be something other than what he is. 
Every moment he swims in the seas of his 
future. Even when he met me, he must have 
known that one day wed be here. Poor Caleb. 
No wonder most precogs end 
up in hospital, overburdened 
by the nature of their gifts, or 
more specifically, overwhelmed 
by the fact that they're unable to 
change anything they see. “And 
you love him?” 

“I do. I'm going to move in 
with him. I’m sorry, Caleb.” 

“Tknow” 

The waitress comes over. 
She places two plates of greasy 
omelette on the table. She looks 
at Caleb, her violet eyes wid- 
ening in recognition. Caleb's 
famous. There aren't too many 
functioning precogs in the 
system. Every now and again, 
someone will put out a docu- 
mentary about him, usually 
spurious, about how he’s refus- 
ing to use his precognition to help people. 
It doesn’t work like that. The future's set. 
No amount of foreknowledge will change 
anything. 

“Thank you,’ I say, trying to dismiss her. 
Just because I don’t want him, doesn’t mean 
that I want anybody else to have him. 

The waitress lingers at a nearby table, 
straightening the place settings, wonder- 
ing how she can attract him, thinking that 
a knowledge of her future might bring her 
an advantage — just like I did when I met 
Caleb. She’s looking for her future, wanting 
to use Caleb, not realizing that the only thing 
we, on this side of the mirror, will ever have 
are reflections. 

“We'll keep in touch, Caleb,’ I say. 

“No, we won't. Goodbye, Alice.” He leaves 
the table, walks over to the waitress. He says 
something that makes her laugh. 

I walk out of the cafe, stepping into my 
future, my unseen and unknowable future, 
without him. = 


Deborah Walker grew up in the most 
English town in the country, but she soon 
high-tailed it down to London where she 
now lives with her partner and two young 
children. 
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Destiny from density 


The identification of a signalling protein that regulates the accumulation of fat and connective tissue in breasts may help to 
explain why high mammographic density is linked to breast-cancer risk. It may also provide a marker for predicting this risk. 


VICTORIA L. SEEWALDT 


ur ability to evaluate the lifetime risk 
() of breast cancer in women who have 
a family history of breast and ovarian 
cancer was revolutionized by the identifica- 
tion' of the breast-cancer-associated genes 
BRCA1 and BRCAZ2. But although this was a 
landmark discovery, it has not translated into 
improvements in the assessment of breast- 
cancer risk in the general population’, because 
more than 95% of women who develop breast 
cancer do not have a BRCA-gene mutation’. 
So the search for biomarkers to predict the 
risk in these women remains a goal for many 
biologists, clinicians and epidemiologists. 
Over the past decade, increased mammo- 
graphic density — thought to result primarily 
from an increased proportion of breast collagen 
content relative to fat-tissue content*’ — has 
emerged as a non-genetic marker of breast- 
cancer risk°. However, the biological mecha- 
nism that relates tissue density to tumour devel- 
opment is poorly defined. Writing in Cancer 
Discovery, DeFilippis et al.’ now provide a link 
between mammographic density and cellular 
signalling during the initiation of breast cancer. 
The human breast is composed of many cell 
types, including connective-tissue (stromal) 
cells, fat cells (adipocytes), cells that make up 
blood vessels (endothelial cells) and immune 
cells. Collectively, these cells make up the 
microenvironment of the breast*. Over the 
past five years, it has been increasingly recog- 
nized that this microenvironment, including 
the relative proportions of different types of 
cell and therefore the tissue density, may have 
a major role in the initiation and progression 
of breast cancer”"®. One indication of this role 
is the epidemiological link between mammo- 
graphic density and cancer risk: Caucasian 
women with high mammographic density 
have a two- to sixfold increase in lifetime risk 
of developing breast cancer®. Another indica- 
tion is the physical change in the breast micro- 
environment that is seen in the initial stages 
of breast-tumour formation. This is described 
as a desmoplastic reaction, which is charac- 
terized by the recruitment and activation of 
stromal cells and results in increased colla- 
gen deposition in the breast — and therefore 
increased mammographic density”. 


adi pose 
cell 


Low mammographic density 
(high CD36) 


High mammographic density 
(low CD36) 


Figure 1 | Regulating the breast microenvironment. The main tissue types in human breasts are 
adipose (fat) tissue and stromal (connective) tissue, which contains collagen. The ratio of fat to collagen 
determines the mammographic density of the breast, with density increasing as the relative amount of 

fat tissue falls. High mammographic density is a known risk factor for breast cancer, and mammographic 
density also increases as a breast tumour forms. DeFilippis et al.’ have found that expression of the cell- 
signalling protein CD36 is lower in several types of cell taken from breasts that have high mammographic 
density, whether or not cancer is present. They also show that a reduction in CD36 expression leads to 
increased collagen deposition and decreased fat accumulation in the breast. 


The increased density is measured by a 
general increase in X-ray absorbance and is 
therefore relatively inexpensive and easy to 
measure. But we have little understanding of 
what determines increased density and how 
this is linked to increased cancer risk and the 
initiation of tumour formation**. The situation 
is complicated by the complex relationships 
between mammographic density, breast- 
cancer risk, ageing, obesity and race. For exam- 
ple, mammographic density typically declines 
with age”, but, because breast cancer is an age- 
related disease, this means that, paradoxically, 
a woman's breast-cancer risk will increase with 
age while her mammographic density declines. 

DeFilippis and colleagues have helped to 
unravel this story by revealing that expres- 
sion of a single molecule, CD36, is necessary 
and sufficient to regulate two key features 
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of mammographic density: the number of 
adipocytes in the breast and the amount of 
collagen deposited in the breast tissue. The 
authors show that expression of CD36 is 
lower in breast-tissue cells of women with high 
mammographic density — in women with or 
without breast cancer — than in cells from 
breasts with normal density (Fig. 1). CD36 is 
an integral membrane protein found in many 
cell types, and it is already known to regulate 
several basic cellular processes governing how 
cells interact and grow, including the differen- 
tiation of adipocytes, the growth of new blood 
vessels (angiogenesis) and immune signal- 
ling’. Now it seems that an influence on mam- 
mographic density can be added to this list. 
To establish how CD36 expression exerts 
its effects on breast-tissue density, the authors 
modulated CD36 expression in vitro in cells 


taken from healthy human breast tissue. They 
found that reducing CD36 expression led to a 
decrease in the total number of fat cells and an 
increase in the amount of collagen deposited 
in the extracellular matrix around the cells; 
higher CD36 expression increased the num- 
ber of fat cells (Fig. 1). The researchers also 
observed lower fat accumulation and greater 
collagen accumulation in the mammary glands 
of mice lacking the Cd36 gene. These findings 
indicate that inhibition of this protein is suf- 
ficient to recapitulate some of the key events 
that lead to higher mammographic density and 
initiation of breast cancer. 

Not only do these findings provide a mech- 
anistic link between cell-signalling pathways 
and mammographic density, they are also a 
convincing indication that loss of CD36 may 
be an early event in breast carcinogenesis. 
Although mammographic density is used as 


a surrogate marker of breast-cancer risk in the 
general population, it is an imprecise indicator. 
It gives little insight into the biology of why a 
woman might develop breast cancer, and it is 
confounded by paradoxes, such as the increase 
in density but decrease in risk if a woman exer- 
cises, or the decrease in density but increase in 
risk as a woman ages. By demonstrating that 
CD36 links the biology of breast density with 
the biology of breast-cancer initiation, DeFil- 
lipis and colleagues have provided information 
that may help to make density a more useful 
marker of risk. Finally, their results also raise 
the possibility that CD36 could be pharma- 
cologically targeted in attempts to prevent 
breast cancer in the majority of women who 
(fortunately) lack a BRCA mutation. m 


Victoria L. Seewaldt is in the Department 
of Medicine, Duke University, Durham, 


Signs of instability 


The finding that pools of gas hydrates — compounds that trap natural gas 
emissions — in ocean sediments are deeper than expected implies that the hydrates 
are destabilizing, and might release gigatonnes of methane. SEE LETTER P.527 


JUERGEN MIENERT 


ethane hydrate is an ice-like 
substance that exists at low tempera- 
tures and high pressures, and which 


fills the pore space of sediments in the seabed 
and sub-seabed. Increases in sea temperature 
could destabilize methane hydrate, releas- 
ing methane gas into the ocean and possibly 
into the atmosphere — where, if released in 
sufficient quantities, it would act as a potent 
greenhouse gas and thus contribute to global 
warming. On page 527 of this issue, Phrampus 
and Hornbach' report a combination of seis- 
mic data and thermal models that suggests that 
changes in the Gulf Stream are rapidly destabi- 
lizing methane hydrate along the North Amer- 
ican continental margin — the zone of ocean 
floor where the deep ocean meets the shallow 
continental shelves, and which marks the tran- 
sition between continental and oceanic crust. 

Methane hydrates are found within a region 
of sediments known as the gas-hydrate stability 
zone (GHSZ). The depth of the GHSZ varies 
mainly with pressure (which depends on water 
depth); with the composition of the natural 
gas trapped in the hydrates (which depends 
on whether the gas is produced by microbes 
at shallow depths or by thermal processes at 
greater depths); with the temperature gradient 
beneath the seabed; and with the water tem- 
perature at the seabed. If the deep gas reser- 
voirs in continental margins are leaking, they 


may contribute methane to the shallower gas- 
hydrate reservoir system. 

One of the most pressing issues in climate 
change is the relationship between ocean 
warming and methane escape from the sea- 
bed’> (Fig. 1). The extent to which such 
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methane venting is connected to the thawing 
of methane-hydrate reservoirs is the crux of 
this issue, but also of two big, related stories 
involving methane in the environment: the 
large-scale collapse of continental slopes 
(steep inclines at the edge of continental 
shelves) and the increasing methane levels 
in the atmosphere. For example, some areas 
of the ocean margins along the east coast of 
the United States® and the Norwegian Arctic’ 
have cracked, and seem to be susceptible to 
future slope failures. This, in turn, may enable 
mechanisms to develop that allow the rapid 
transfer of methane from sub-seabed gas- 
hydrate deposits into the upper ocean and the 
atmosphere, which would add to greenhouse- 
gas emissions. Such destabilization in the 


Sea surface 


Water depth (m) 
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Figure 1 | Bubbling up. Hydrocarbon gas, including methane, seeps from sub-seabed reservoirs and 
forms plumes of bubbles. The graphic shows two plumes in the Gulf of Mexico as an east-west trace, 
based on echo-sounding data’’. Phrampus and Hornbach' report that changes in the Gulf Stream could 
trigger the release of gigatonnes of methane from ocean sediments off the east coast of America. (Figure 


adapted from ref. 10.) 
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methane-hydrate reservoirs and continen- 
tal slope may occur at intermediate depths 
along upper continental margins, down to 
approximately 1,000 metres below sea level. 

Phrampus and Hornbach’s paper adds to the 
scientific debate concerning the risk of sub- 
stantial, rapid methane emissions in the future, 
and will certainly draw considerable attention 
to the cracked continental slope of the east 
coast of North America. The authors’ seis- 
mic data reveal that the GHSZ in this region 
is deeper than predicted from models based 
on current ocean temperatures, indicating 
that transient processes for methane-hydrate 
melting are occurring in the sub-seabed. The 
researchers rule out several factors that could 
explain this observation, such as changes of sea 
level, increases in the contribution of thermo- 
genic gas from deep hydrocarbon reservoirs, 
increased sedimentation rates and lower heat 
flow in the sub-seabed. Instead, they conclude 
that the discrepancy was probably caused 
solely by warming of the ocean at intermediate 
depths over the past few thousand years, the 
heat from which has not yet fully penetrated 
the seabed, and so has yet to affect the depth 
of the GHSZ. 

Phrampus and Hornbach argue that 
the required warming can be explained by 
changes in the temperature or path of the 
Gulf Stream within the past 5,000 years or so. 
Extrapolating from their data, they estimate 
that these changes will ultimately warm the 
western North Atlantic margin by as much 
as 8°C, and will trigger the destabilization of 
2.5 gigatonnes of methane hydrate. What's 
more, taking into account changes observed 
in the Arctic environment** that could also 
cause methane-hydrate destabilization, the 
authors suggest that this quantity may repre- 
sent only a fraction of the methane hydrate that 
is currently destabilizing around the world. 
However, the extent of global destabilization 
is difficult to determine exactly, because esti- 
mates of the temperatures of ocean water at 
intermediate depths on glacial—interglacial 
timescales of several thousand years suggest 
that ocean warming may be different between 
oceans’. 

Whether the destabilization of gigatonnes of 
methane hydrate from the east coast of North 
America would adversely affect climate in the 
future depends on how rapid temperature 
changes associated with the Gulf Stream will 
affect approximately 10,000 square kilometres 
of the US eastern continental margin, an area 
prone to submarine landslides. More broadly, 
the big unknowns regarding this century’s 
ocean-temperature shifts are to what extent, 
and how rapidly, such shifts will reduce the 
stability of methane hydrate in ocean mar- 
gins. It may (hopefully) turn out to be the case 
that intermittent increases in ocean tempera- 
ture along methane-hydrated regions are not 
enough to increase ocean acidification and — 
if methane is released from the ocean — the 


level of carbon in the atmosphere. 

Phrampus and Hornbach’s GHSZ mapping 
may provide a timely lesson about how the 
conditions of the sea floor down to a depth of 
about 1,000 metres below sea level can change 
from favouring methane-hydrate formation to 
favouring methane-hydrate melting. It shows 
that we must use seismic records as indicators 
of past conditions affecting ocean methane 
hydrates, and also suggests that real-time, 
long-term recording of anticipated sea-floor 
releases of gigatonnes of methane are needed 
to provide the ultimate proof that a warm- 
ing Gulf Stream is causing methane-hydrate 
melting. = 
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Snapshot of an activated 
peptide receptor 


Developing therapeutic drugs that target peptide receptors is challenging. 
The structure of one of these G- protein-coupled receptors, NTS1, activated 
and bound to a peptide, provides an excellent starting point. SEE ARTICLE P.508 


FELIX HAUSCH & FLORIAN HOLSBOER 


ome peptides act as chemical signals 

between cells, controlling mood, behav- 

iour, stress responses, blood pressure, 
digestion and cancer progression’. They typi- 
cally interact with receptor proteins at the cell 
surface — for example, with the G-protein- 
coupled receptors (GPCRs) that pass the 
message on to G proteins inside the cells to 
induce specific cellular responses. Pharma- 
cological activation of peptide-recognizing 
GPCRs could thus enable improved therapies 
to be produced for diseases that involve these 
receptors. Unfortunately, drug development 
for these receptors is notoriously difficult, in 
part owing to the lack of structural informa- 
tion about how the peptide binds or how the 
receptor is activated. On page 508 of this issue, 
White et al.” present the structure of a typical 
peptide-interacting GPCR (called neuroten- 
sin receptor 1), bound to an activating pep- 
tide. This work gives the first insight into the 
mechanisms by which this class of receptor is 
turned on and will substantially facilitate the 
design of candidate drugs*. 

One of the difficulties associated with the 
discovery of drugs that act on peptide-rec- 
ognizing GPCRs is that peptides often bind 
to their target proteins through multiple 


*This article and the paper under discussion’ were 
published online on 10 October 2012. 
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hydrogen bonds or electrostatic interactions 
— something that is difficult to replicate in 
synthetic molecules while maintaining their 
drug-like properties. Moreover, unlike cyto- 
plasmic proteins, GPCRs are inserted in the 
cell membrane and are therefore unstable in 
the aqueous solutions used for the crystallo- 
graphic techniques that are commonly needed 
for structure-based drug design. 

White and colleagues’ structure of the 
peptide-activated neurotensin receptor 1 
(NTS1) is the latest result of a technological 
revolution that GPCR biochemistry has wit- 
nessed in recent years” °. It was made possible 
by several methodological breakthroughs, 
such as the engineering of GPCRs to create 
receptors that are more stable than their natu- 
ral counterparts (by changing specific amino 
acids or by fusion with other stabilizing pro- 
teins), as well as the development of specific 
conditions for crystallizing membrane pro- 
teins (lipidic cubic phase methods). 

The overall protein architecture of NTS1 is 
expected to be representative of the large and, 
so far, structurally unexplored B-subgroup of 
GPCRs. More importantly, the structure clari- 
fies how peptide agonists (that is, activators) 
interact with GPCRs. Until now, peptide ago- 
nists had been thought to bind to the extra- 
cellular side of their GPCRs, but any further 
details were unclear. White et al. show that 
a neurotensin analogue (a peptide similar 


NTS1 receptor 
Activating peptide 


Cell 
membrane 


Activation of G proteins 


Figure 1 | Peptides digging deep. White et al.’ 
elucidated the structure of the NTS1 protein 

(a G-protein-coupled receptor that is inserted 

in the cell membrane) bound to an activating 
peptide. The protein's backbone is shown in dark 
brown, whereas its surface is indicated as a grey 
transparent envelope. Arginine 327 is a key residue 
for activation of the receptor. Some amino-acid 
residues (from arginine 328 to glutamate 337) 
have been omitted to allow a better view of the 
peptide. Carbon, oxygen and nitrogen atoms are 
green, red and blue, respectively, in the peptide. 
The cytoplasmic part of the receptor passes the 
activation signal to G proteins, which in turn 
trigger cellular responses. 


to neurotensin, NTS1’s natural substrate) 
penetrates deeply into the interior of the recep- 
tor like a nail (Fig. 1), albeit not as deeply as 
non-peptidic agonists do in the GPCRs to 
which they bind. 

Because agonist binding activates GPCRs by 
inducing changes in the receptors’ conforma- 
tion, a credible structure of an activated state 
is essential for the design of drug-like agonists. 
This seems to have been achieved for NTS1, as 
judged on the basis of previous biochemical 
studies and plausible comparisons with other 
activated GPCRs. However, a touch of ambigu- 
ity in the NTS1 structure is the poor resolution 
of the carboxy-terminal end of the bound pep- 
tide, which is precisely the part of the agonist 
that seems to be most important for activation 
of the receptor. 

How does neurotensin turn its receptor on? 
The authors observe several intriguing inter- 
actions between amino-acid residues at the 
receptor’s cytoplasmic face; such interactions 
might have a role in the activation mechanism. 
This hypothesis can now be tested straightfor- 
wardly by altering those residues by making 
specific mutations in the NTS1-encoding gene. 
Although the structure clearly shows how 
NTS1 binds to the peptide on the receptor’s 
extracellular side, how this is translated into 


the observed active-like conformation at the 
cytoplasmic face is less clear. It is possible that 
neurotensin pulls on certain residues on NTS1 
(such as arginine 327, see Fig. 1), as happens 
during the activation of another GPCR, the 
§2-adrenergic receptor’ "’. However, the full 
details of NTS1 activation might be revealed 
only by a direct comparison of White and 
colleagues’ structure with an inactive-like 
state — for example, that of NTS1 in com- 
plex with an antagonist, a molecule that locks 
the receptor into an inactive conformation. 
The structure of such an inactive state is now 
eagerly awaited. 

In summary, the presented NTS1 structure 
gives an immediate guideline for the design of 
improved agonists. The neurotensin analogue 
binds in a concave pocket and establishes 
mainly hydrophobic contacts with the recep- 
tor, rather than extensive hydrogen bonds or 
electrostatic interactions — characteristics that 
drug discoverers like to see. If this knowledge 
can indeed be translated into potent NTS1 
agonists (ideally, small non-peptide mol- 
ecules), White and colleagues’ work could be 
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the beginning of the end for the ‘poor drug- 
gability’ mantra commonly associated with 
peptide-interacting GPCRs. And, because of 
neurotensin’ role as a neurotransmitter in the 
brain, it might eventually lead to new treat- 
ment options for mental disorders such as 
schizophrenia’. = 
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The inner life 


of proteins 


A quantitative analysis shows that epistasis — the fact that genetic background 
determines whether a mutation is beneficial, deleterious or inconsequential — is 
the main factor regulating evolution at the level of proteins. SEE LETTER P.535 


GUNTER P. WAGNER 


here is a long-standing controversy in 

evolutionary biology about the rela- 

tive importance of ‘internal’ versus 
‘external’ factors in determining the rate and 
outcomes of evolution. Is evolution primarily 
determined by ecology and the environment, 
or do an organism's own functional and devel- 
opmental features make a substantial contri- 
bution? Now, at least at the level of proteins, 
it seems we have an answer. On page 535 of 
this issue, Breen et al.' present a rigorous 
statistical analysis of protein sequences that 
suggests that functional interactions among 
amino acids are a major determinant of 
protein-sequence evolution. 

Most people are familiar with the principle 
of adaptive evolution, which underlies Charles 
Darwin and Alfred Russel Wallace's theories 
of natural selection. This model describes 
how organisms change in response to biotic 
or abiotic variations in their environment 
— familiar examples include the large body 
size and white fur of the polar bear, where 


the body size helps to retain heat and the fur 
colour provides camouflage. In fact, responses 
to environmental conditions have been so suc- 
cessful in explaining biological diversity that 
it became tempting to see adaptation as the 
only factor determining evolutionary out- 
comes and patterns. In its extreme, this view 
leads to what philosophers call the function- 
alist interpretation of life — the idea that the 
function of body parts explains their existence 
and structure. 

However, evolutionary biologists know that 
factors internal to organisms also contribute to 
biological diversity. Darwin himself discussed 
the ‘law of correlated growth, which states that 
size changes in various body parts do not occur 
independently of each other but are subject 
to limitations that result from the integrated 
nature of development and growth. In contem- 
porary language, these factors are considered 
constraints that arise from functional or devel- 
opmental interdependencies between parts of 
the organism. 

Fundamentally, these constraints manifest 
themselves as interactions between genes. 
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Here, a key concept is epistasis, the term used 
to describe the context dependency of muta- 
tion effects — in other words, that the genetic 
background on which a mutation occurs deter- 
mines whether the mutation has any effect and, 
if so, whether it is beneficial or deleterious. A 
frequently observed manifestation of epistasis 
occurs between the sex-determining factors 
(in mammals, the X and Y chromosomes) 
and some disease risk factors. For instance, 
in men the €3/e2 genotype at the gene encod- 
ing the protein ApoE leads to an earlier onset 
of coronary artery disease, compared to the 
€3/e3 genotype’. But in women, no such effect 
is observed. 

In evolutionary theory, however, epista- 
sis has a curious status. One of the pillars 
of population genetics is the ‘fundamental 
theorem’ of natural selection, which says 
that the response to selection, and thus the 
process of adaptation, depends only on the 
context-independent (additive) genetic effects 
that exist in a population; according to this 
theory, although epistasis exists, it is simply 
noise in an other- 
wise fairly deter- 
ministic process’. 


“Amino-acid 


substitutions This hypothesis 
willpersist, on is widely misun- 
an evolutionarily derstood, even by 
relevant timescale, many population 
only when the geneticists, as say- 
‘correct’ amino ing that epistasis 
acidsare present _ is insignificant. 
elsewhere in the Breen and col- 


leagues’ results 
provide a convin- 
cing demonstra- 
tion to the contrary, demonstrating that 
epistasis is the primary factor affecting the 
evolution of proteins. 

The authors used an ingenious approach to 
test how epistatic interdependencies among 
the amino-acid residues of a protein contrib- 
ute to the rate at which that protein evolves. 
They studied the amino-acid sequences of 
16 proteins for which sequence informa- 
tion was available, in public databases, for at 
least 1,000 species. From this analysis they 
estimated that, on average, each position in 
a protein accepts around eight alternative 
amino-acid residues. They then reasoned 
that, if these alternative amino-acid residues 
are equally acceptable in the protein, regardless 
of the amino-acid composition of the rest of 
the protein — in other words, without epista- 
sis — then the rate of amino-acid evolution 
should be about 60% of the neutral rate (the 
rate that would occur if all amino acids were 
acceptable). 

However, they found instead that the rate 
of protein evolution is only 5% of the neutral 
rate. After excluding several potential sources 
of error that could influence this statistic, they 
conclude that interdependency among dif- 
ferent amino-acid residues in a protein is the 


protein.” 


major factor determining its rate of evolution. 
This means that, in the vast majority of cases, 
amino-acid substitutions will persist, on an 
evolutionarily relevant timescale, only when 
the ‘correct’ amino acid, or amino acids, are 
present elsewhere in the protein. 

It follows that internal constraints — not 
only internal to the organism but internal to 
each protein — are the dominant factor in 
determining the rate of protein evolution. 
If that is true at the level of individual pro- 
teins, then it is likely also to be true at the 
level of the organism. Thus, Breen and col- 
leagues have provided convincing evidence 


COSMOLOGY 


that epistasis should be considered alongside 
adaptation as a key player in evolution. = 
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Infrared light from 
wandering stars 


An explanation has been proposed for the observed excess of cosmic light at 
infrared wavelengths. It invokes stars that are cast into the dark-matter haloes 
of their parent galaxies during powerful galaxy collisions. SEE LETTER P.514 


ANDREA FERRARA 


ver since the collective infrared light 
He cosmic sources was found’ to 

exceed the expected emissions from 
known galaxies, researchers have consid- 
ered” ° whether the excess might comprise 
radiation from distant stars and galaxies 
too faint to be detected individually. How- 
ever, on page 514 of this issue, Cooray et al.’ 
suggest instead that the excess signal could 
be provided largely by nearby stars that were 
stripped from the main body of their parent 


galaxies during collisions. 

According to the standard Big Bang model, 
cosmic structures originated from tiny lumps 
of unseen dark matter in the early Universe 
that grew large enough to collect the normal 
(baryonic) matter from which stars eventu- 
ally formed. On theoretical grounds'*, it is 
believed that the first stars were 10-100 times 
more massive than the Sun, because their 
parent gas clumps would have been poor in 
metals (elements other than hydrogen and 
helium), enabling them to avoid fragmentation 
into smaller clumps. 


Figure 1 | Galaxy collision. Cooray et al.’ propose that the observed large-scale excess of cosmic infrared 
background radiation could be produced by stars that were thrown into the outer reaches of their parent 
galaxies during galaxy collisions such as the one shown here. 
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Searching for the first stars and, more 
generally, for the first generations of galax- 
ies that might have hosted them, is a primary 
function for many current and future tele- 
scopes. Knowledge of the first stars would 
shed light on our origins, and allow research- 
ers to refine the Big Bang model. Over the 
past decade, we have witnessed remarkable 
progress in the discovery of ever more dis- 
tant galaxies, with the record-holder updated 
almost monthly. Deep images from the Hub- 
ble Space Telescope’s infrared camera WFC3 
— which has exquisite sensitivity and a rela- 
tively wide field of view — routinely reveal’”° 
candidate galaxies at early cosmic epochs. 
And, whenever possible, these candidates are 
subsequently confirmed using spectroscopic 
observations. 

However, there is evidence to suggest that 
the observed early-epoch galaxies alone could 
not have provided the radiation that was 
required to ‘reionize’ hydrogen atoms, an event 
that was complete by about 1 billion years after 
the Big Bang'"”. It is therefore possible that 
astronomers are observing only the most lumi- 
nous objects in an underlying population of 
faint galaxies. 

How can this hypothesis be tested? This is 
where the collective infrared light from cosmic 
sources, or the cosmic infrared background 
(CIB), comes in. Measurements of the CIB 
intensity, or more precisely its fluctuations, 
could potentially be used to detect the com- 
bined emissions from a population of faint gal- 
axies. Even after removal of the infrared light 
from known galaxies, the CIB intensity is not 
perfectly uniform across the sky: it shows fluc- 
tuations in different directions of about 10% 
around the mean, indicating that the under- 
lying sources of the signal gather into clusters. 
The degree to which these sources cluster as a 
function of their angular separation on the sky 
is encoded in a mathematical quantity known 
as the angular power spectrum. However, the 
CIB intensity is contaminated by sunlight 
reflected by local interplanetary dust in the 
Solar System — the zodiacal light. The detec- 
tion of a faint-galaxy population from CIB 
fluctuation measurements is therefore depend- 
ent on the accurate subtraction of this light. 

Previous high-resolution infrared images 
taken with the Spitzer Space Telescope’** and 
the AKART satellite’° have made it possible 
to measure such fluctuations and to subtract 
the contribution of the zodiacal light reliably. 
These observations have revealed that the 
angular power spectrum, and thus the cluster- 
ing strength, increases towards large angular 
distances on the sky, up to one degree. This 
increase cannot be accounted for solely by the 
light produced by known galaxies'®. 

In their study, Cooray et al. analyse recent 
deep images from Spitzer and detect the 
same increase in clustering strength. What's 
more, they demonstrate that the increase 
cannot be explained by either of two existing 


hypotheses — one based ona contribution from 
a population of faint, distant galaxies dur- 
ing the cosmic ‘reionization’ epoch’”””, and 
another that invokes dwarf galaxies at inter- 
mediate distances from Earth. Intriguingly, 
however, the shape of the power spectrum is 
consistent with the distant-galaxy hypothesis. 

What, then, is producing these large-scale 
CIB fluctuations? Cooray and colleagues pro- 
pose that the source of this signal is light from 
intrahalo stars of known galaxies — that is, 
stars that have been stripped from the main 
body of their parent galaxies and cast into the 
galaxies’ dark-matter haloes during galaxy 
collisions (Fig. 1). As interesting and plausi- 
ble as it is, this explanation is based heavily on 
the poorly understood abundance and spec- 
tral energy distribution of intrahalo stars. In 
contrast to the distant-galaxy hypothesis, this 
model also predicts fluctuations induced by 
intrahalo stars in the visible part of the electro- 
magnetic spectrum, in which the light from 
the first galaxies is blanketed by intervening 
intergalactic neutral hydrogen. 

It will be interesting to see whether the 
authors’ proposal stands up to scrutiny. How- 
ever, the most exciting endeavour will be to 
isolate the CIB signal produced by the faint, 
reionizing galaxies and thereby make them 
amenable to study. Because this signal is buried 
under the putative signal of the intrahalo stars, 
it will be necessary to accurately remove this 
‘foreground before that goal can be attained. m 
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50 Years Ago 


Nuclear explosions in space produce 
negligible effects on the Earth, for 
example, a distinctly decreased 
intensity of fall-out; they produce 
more significant perturbations of the 
upper atmosphere, but these are very 
temporary (measured in hours). 

It is in near-space itself where very 
noticeable effects can be produced 
which can last for many months 
because of the presence of the Earth’s 
magnetic field. It is evident that 
weapons tests in space can have 
important scientific by-products. 
The most interesting are the 
creation of a thin shell of 2-MeV 
fission electrons ... as well as the 
creation of a thick belt oflow-energy 
protons from fusion neutrons. It will 
probably be desirable to monitor 
these artificial radiation belts with 
quite sophisticated energy-resolving 
detectors, to suppress the naturally 
trapped radiation, as well as the 
much larger changes (of factors of 
10-100 and even higher) produced 
by such natural events as solar flares 
and magnetic storms. 

From Nature 27 October 1962 


100 Years Ago 


American naturalists are delighted 
by the announcement that Mrs. 
Russell Sage, widow of the late 
well-known Wall Street financier, 
has purchased Marsh Island, off the 
coast of Louisiana, in order to make 
it a perpetual bird sanctuary ... The 
island in question is about 

75,000 acres in extent. It is a sylvan 
labyrinth affording shelter and 
food to hundreds of thousands 

of wild birds which resort to it in 
winter when the northern lakes and 
streams are locked fast under ice. 
For this reason it has been the great 
killing ground for the markets of 
New Orleans, St. Louis, Cincinnati, 
and Chicago, no fewer than seventy 
market gunners being regularly 
employed there every winter. 

From Nature 24 October 1912 
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A map of rice genome variation reveals 
the origin of cultivated rice 


Xuehui Huang'*, Nori Kurata**, Xinghua Wei**, Zi-Xuan Wang'**, Ahong Wang’, Qiang Zhao', Yan Zhao', Kunyan Liu’, 
Hengyun Lu!, Wenjun Li!, Yunli Guo!, Yigi Lu', Congcong Zhou', Danlin Fan', Qijun Weng!, Chuanrang Zhu', Tao Huang’, 


Lei Zhang’, Yongchun Wang’, Lei Feng’, Hiroyasu Furuumi’, Takahiko Kubo”, Toshie Miyabayashi’, Xiaoping Yuan’, Qun Xu*, 


Guojun Dong”, Qilin Zhan', Canyang Li‘, Asao Fujiyama’, Atsushi Toyoda’, Tingting Lu’, Qi Feng’, Qian Qian’, Jiayang Li* 
& Bin Han"° 


Crop domestications are long-term selection experiments that have greatly advanced human civilization. The 
domestication of cultivated rice (Oryza sativa L.) ranks as one of the most important developments in history. 
However, its origins and domestication processes are controversial and have long been debated. Here we generate 
genome sequences from 446 geographically diverse accessions of the wild rice species Oryza rufipogon, the 
immediate ancestral progenitor of cultivated rice, and from 1,083 cultivated indica and japonica varieties to construct 
a comprehensive map of rice genome variation. In the search for signatures of selection, we identify 55 selective sweeps 
that have occurred during domestication. In-depth analyses of the domestication sweeps and genome-wide patterns 
reveal that Oryza sativa japonica rice was first domesticated from a specific population of O. rufipogon around the 
middle area of the Pearl River in southern China, and that Oryza sativa indica rice was subsequently developed from 
crosses between japonica rice and local wild rice as the initial cultivars spread into South East and South Asia. The 
domestication-associated traits are analysed through high-resolution genetic mapping. This study provides an 


important resource for rice breeding and an effective genomics approach for crop domestication research. 


Cultivated rice (Oryza sativa L.), which is grown worldwide and is one 
of the most important cereals for human nutrition, is considered to 
have been domesticated from wild rice (Oryza rufipogon) thousands 
of years ago’ *. The differences between O. sativa and O. rufipogon are 
reflected in a wide range of morphological and physiological traits*°. 
Despite the fact that rice is a major cereal and a model system for plant 
biology, the evolutionary origins and domestication processes of culti- 
vated rice have long been debated. The puzzles about rice domestication 
include: (1) where the geographic origin of cultivated rice was, (2) which 
types of O. rufipogon served as its direct wild progenitor, and (3) whether 
the two subspecies of cultivated rice, indica and japonica, are derived 
from a single or multiple domestications. 

A wide range of genetic and archaeological studies have been carried 
out to examine the phylogenetic relationships of rice, and investigate 
the demographic history of rice domestication’®”’. Molecular phylo- 
genetic analyses indicated that indica and japonica originated inde- 
pendently*’°’°. However, the well-characterized domestication genes 
in rice were found to be fixed in both subspecies with the same alleles, 
thus supporting a single domestication origin®*"*. Recently, a demo- 
graphic analysis of single-nucleotide polymorphisms (SNPs) detected 
from 630 gene fragments suggested a single domestication origin of 
rice’’, Meanwhile, population genetics analyses of genome-wide data 
of cultivated and wild rice have tended to suggest that indica and 
japonica genomes generally appear to be of independent origin'’*”’, 
but many genomic segments bearing domestication alleles may have 
originated only once'*. Despite these advances, wider sampling with 
population-scale whole-genome sequencing is needed to shed greater 
light on the evolutionary history of rice domestication. An in-depth 


investigation of the haplotype structure near the domestication sites 
will be critical for evaluating the direction of introgression. The specific 
ancestral population and the subsequent demographic event are yet to 
be identified. 

Moreover, a comprehensive map of rice genome variation will facil- 
itate genetic mapping of complex traits in rice. We recently collected 
diverse rice cultivars for sequencing, and carried out genome-wide 
association studies (GWAS) for many agronomic traits in cultivated 
rice*!”*, Here we sequenced and analysed the genomes of 446 O. rufi- 
pogon accessions to investigate the phylogenetic relationships between 
cultivated and wild rice and identify the signatures of selection in rice 
domestication. This research also provides a robust foundation that 
will enable rice breeders to effectively exploit diverse genetic resources 
for rice improvement. 


Analysis of wild rice populations 

The strategy of this study is briefly described in Supplementary Fig. 1. 
The genus Oryza consists of 23 species”*, and the wild rice O. rufipogon 
is believed to be the immediate progenitor of the cultivated rice O. sativa 
(Supplementary Fig. 2 and Supplementary Table 1). From large collec- 
tions of wild rice germplasm maintained in China and Japan, we selected 
446 diverse O. rufipogon accessions including both perennial and 
annual (also called as Oryza nivara) forms from Asia and Oceania, 
spanning the native geographic range of the species (Supplemen- 
tary Tables 2 and 3). We sequenced these accessions with twofold 
genome coverage. After aligning the reads against the rice reference 
genome sequence, we identified a total of 5,037,497 non-singleton 
SNPs. Based on the SNP data, the sequence diversity (7) of O. rufipogon 
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was estimated at ~0.003, which is higher than that in O. sativa—the 
sequence diversity is 0.0024 for O. sativa, and 0.0016 and 0.0006 for 
indica and japonica, respectively’. Approximately 82% of SNPs (minor 
allele frequency > 0.05) segregating in O. rufipogon also segregate in 
O. sativa (Supplementary Table 4). This observation is consistent with 
previous suggestions that part of the genetic diversity in the progenitor 
would be lost because only a limited number of individuals were used 
during domestication”. 

We investigated the population structure of the O. rufipogon acces- 
sions. On the basis of the neighbour-joining tree, as well as principal- 
component analysis (PCA), we classified the O. rufipogon species into 
three types, simply designated as Or-I, Or-II and Or-II] in this study 
(Fig. 1a and Supplementary Fig. 3). We found that the rice population 
structure strongly correlated with geographic distribution (1° = 0.2 
between the first principal component and the longitude, and 
r° = 0.3 between the second principal component and the latitude)”®. 
Interestingly, the O. rufipogon accessions sampled from southern 
China mostly belong to the Or-III type (Fig. 1b and Supplementary 
Fig. 4). The level of population differentiation, Fs;, was estimated at 
0.18 among the groups of O. rufipogon, which is much lower than that 
of O. sativa (~0.55 on average”’). The differentiation was not evenly 
distributed across the rice genome (Supplementary Fig. 5). We 
scanned the whole genome for highly differentiated loci and found 
68 loci with Fs; > 0.3, which covered ~3% of the complete rice genome 
(Supplementary Table 5). Of these loci, we found several known genes 
or quantitative trait loci (QTLs) that included DPL2 (hybrid incompa- 
tibility’’) (Fig. 1c), OsSOC1 and Ghd7 (flowering time**~°) and gCTS12 
(cold tolerance’), all of which have been reported to be closely related to 
indica-japonica differentiation in rice. Hence, the highly differentiated 
loci can provide important clues for searching the genes involved in 
local adaptation. 
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Figure 1 | Genetic structure and association analysis in the wild rice 
population. a, Neighbour-joining tree of 446 O. rufipogon accessions, which 
was calculated from ~5 million SNPs, identifies the three groups of Or-I (red), 
Or-II (grey) and Or-III (blue). b, Geographic origins of wild rice accessions. 
c, The level of genetic differentiation (Fsr) in O. rufipogon population around 
the DPL2 gene that underlies indica-japonica hybrid incompatibility in rice. 
d, Regional Manhattan plots of GWAS for tiller angle in O. rufipogon 
population identify a known gene, PROGI, using a compressed mixed linear 
model. The genome-wide significance threshold (1 X 10° °) and the position of 
the peak SNP are indicated by a horizontal dash-dot line and a vertical red line, 
respectively. 
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Because O. rufipogon is an out-crossing species, it was expected to 
have a relatively high decay rate of linkage disequilibrium. We found 
that the decay rate of linkage disequilibrium in O. rufipogon, expressed 
as r°, dropped to half of its maximum value at ~20 kilobases (kb) on 
average (Supplementary Fig. 6), which is much more rapid than that in 
O. sativa (~123 kb and ~167 kb in indica and japonica, respectively)”'. 
To perform GWAS in O. rufipogon, we used the k-nearest neighbour 
algorithm for data imputation (Supplementary Table 6), and pheno- 
typed the O. rufipogon population for two traits, leaf sheath colour and 
tiller angle. The strongest associations for sheath colour and tiller angle 
were found to be just around the known loci OsCI1, for colouration”, and 
PROGI, for prostrate growth’* (Fig. 1d and Supplementary Figs 7-9). 
Through computational simulations we predicted that the mapping 
resolution of GWAS in O. rufipogon was approximately three times 
greater than that in O. sativa on average (Supplementary Figs 10-12). 
Hence, the wild rice population and accompanying comprehensive 
sequence resource should be of great utility for directly dissecting 
agronomic traits in rice. 


Phylogenetic relationships of rice 


We used whole-genome sequencing data for a large panel of acces- 
sions containing 446 O. rufipogon accessions and 1,083 O. sativa 
varieties to explore their phylogenetic relationships. The 1,083 diverse 
O. sativa varieties were collected throughout the world and sequenced 
with one-fold genome coverage (Supplementary Table 7). Of these, 
950 genomes had previously been reported”'”’, and the remaining 133 
genomes, including many representative varieties (for example, aro- 
matic rice), were sequenced and first reported in this study. A total of 
7,970,359 non-singleton SNPs were identified from the 1,529 genome 
sequences. To determine the ancestral states of the SNPs, the close 
relatives of O. rufipogon and O. sativa were also sequenced with a total 
of approximately 50-fold genome coverage (Supplementary Table 8). 
Using genome sequences of these outgroups, we were able to identify the 
ancestral states of 6,119,311 SNPs out of the approximately 8 million 
SNPs (77%). 

We used the genotype data set of the ~8 million SNP sites from 
1,529 genomes to infer genome-wide relationships. Both the phylo- 
genetic tree and the PCA plots indicate that O. sativa indica and 
japonica are descended from Or-I and Or-IIl, respectively (Fig. 2a 
and Supplementary Fig. 13). We then investigated the detailed 
relationship between indica and Or-I and that between japonica 
and Or-III separately (Supplementary Figs 14-16). The level of genetic 
differentiation between indica and Or-I was modest (Fs; = 0.17), and 
indica contains approximately 75% of the genetic diversity in Or-I 
(Fig. 2b). A small number of indica and Or-I accessions seemed to 
be intermediate between cultivated and wild rice (Fig. 2a). In contrast, 
the japonica groups (temperate japonica, tropical japonica and aromatic) 
were all clustered together, and had descended from wild rice in southern 
China (sub-clade Or-IIIa). There was an obvious genetic distinction 
between japonica and Or-IIIa (Fig. 2a), which had a relatively high level 
of population differentiation (Fs; = 0.36). We found that only approxi- 
mately 33% of the genetic diversity of Or-III persisted in japonica 
(Fig. 2b). The strong genetic bottleneck indicates that a small effective 
population from Or-III was used for domesticating japonica cultivars. 

On the basis of our SNP data, most sequence variants observed 
between indica and japonica, which were estimated to have diverged 
hundreds of thousands of years ago”®, already existed in the progenitor 
gene pools. However, the differentiation was enhanced during domesti- 
cation, with Fs; expanding from 0.18 in O. rufipogon to 0.55 in O. sativa 
(Fig. 2b). The high level of genetic differentiation between indica and 
japonica has resulted from a combination of the modest differentiation 
present within their ancestor population and the recent domestication 
bottleneck (Fig. 2c). For example, at the causal SNP site for GS3 
(a major QTL for grain shape****), the null allele mainly existed in 
the wild rice populations in the Guangxi and Guangdong provinces 
of southern China. This allele rapidly extended to become the major 
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Figure 2 | Genome-wide relationship between cultivated rice and its wild 
progenitor. a, Phylogenetic tree of the full population (446 O. rufipogon 
accessions and 1,083 O. sativa varieties) calculated from ~8 million SNPs in O. 
rufipogon and O. sativa. The double-layer rings indicate O. rufipogon (outer 
ring: Or-I, Or-II and Or-III are coloured in red, grey and blue, respectively) and 
O. sativa (inner ring: indica and japonica subspecies are in pink and sky blue, 


allele (~98%) in the japonica population, thus generating a distinct 
phenotypic difference between indica and japonica. We screened all 
SNPs that were highly differentiated in frequency between indica and 
japonica, and found a total of 213,188 indica—japonica-differentiated 
SNPs. With regard to the differentiated SNPs, the differences present 
within the O. rufipogon, the domestication bottleneck in japonica and 
in indica were estimated to contribute ~50%, ~40% and ~ 10% of total 
alterations in allele frequency, respectively (Supplementary Fig. 17). 
We examined the ancestral states of indica—japonica-fixed SNPs, 
and found that 55% of SNPs of the ancestral alleles are fixed in indica 
and the other 45% are fixed in japonica. The dN/dS ratio (dN, number 
of non-synonymous substitutions per non-synonymous site; dS, 
number of synonymous substitutions per synonymous site) of the 
SNPs was calculated to be 0.34, which was almost equal to the average 
level of the total SNPs. In contrast, for the 9,595 SNPs that were fixed 
between O. rufipogon and O. sativa, the ancestral alleles of 93.3% 
SNPs are identical to O. rufipogon, indicating that O. rufipogon has 
retained more ancestral states than O. sativa, which also further sup- 
ported our conclusion that O. rufipogon is likely to be the ancestral 
progenitor of O. sativa. The dN/dS ratio of the fixed SNPs between O. 
rufipogon and O. sativa is calculated to be 1.04, indicating positive 
selection during domestication (P < 0.001, chi-square test). 
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respectively). b, Illustration of genetic diversity and population differentiation 
in O. rufipogon and O. sativa. The size of the circles represents the level of 
genetic diversity (7) of the groups, and the Fsr values between the groups are 
indicated. ind, indica; jap, japonica. c, The spectrum of allele frequencies at the 
causal polymorphisms of Ghd7, DPL2 and GS3. 


Screens and annotation of domestication loci 


Selective signatures from domestication include a reduction in nucleo- 
tide diversity and altered allele frequency in the domestication loci’’””. 
We measured the ratio of the genetic diversity in wild rice to that in 
cultivated rice (7,,/m,) across the rice genome, and determined the 
cutoff on the basis of permutation tests (Supplementary Information 
section 2). We performed whole-genome screening in indica, japonica 
and the full population using the diversity ratios (Fig. 3 and Sup- 
plementary Figs 18-20). In total we identified 60 loci in indica, 62 in 
japonica and 55 in the full population (Supplementary Tables 9-11). 
We noticed that many loci with strong signals of selection were nearly 
identical in both indica and japonica where Fs; between indica 
and japonica was extremely low, indicating that introduction of traits 
during domestication has in many cases involved introgression events. 
We noted that most well-characterized domestication genes, including 
Bh4 (hull colour’), PROG (tiller angle’*), sh4 (seed shattering”®), 
qSW5 (grain width**) and OsCI1 (leaf sheath colour and apiculus 
colour’), were among the 55 loci detected in the full population 
(Fig. 3). Another three well-characterized domestication genes, qSH1 
for seed shattering’*, Waxy for grain quality’’ and Rc for pericarp 
colour**’, which showed strong selection signals in the japonica panel, 
were not fully shared in the indica population. 
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Figure 3 | Whole-genome screening and functional annotations of 
domestication sweeps. a, Whole-genome screening of domestication sweeps 
in the full population of O. rufipogon and O. sativa. The values of ty/7, are 
plotted against the position on each chromosome. The horizontal dashed line 
indicates the genome-wide threshold of selection signals (2,,/7,. > 3). b-d, A 
large-scale high-resolution mapping for fifteen domestication-related traits was 
performed in an O. rufipogon X O. sativa population. The domestication 
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sweeps overlapped with characterized domestication-related QTLs are shown 
in dark red, and the loci with known causal genes are shown in red. Among 
them, three strong selective sweeps were found to be associated with grain 
width (b), grain weight (c) and exserted stigma (d), respectively. In b-d, the 
likelihood of odds (LOD) values from the composite interval mapping method 
are plotted against position on the rice chromosomes. Grey horizontal dashed 
line indicates the threshold (LOD > 3.5). 
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We investigated the genetic patterns around the well-characterized 
domestication loci, which were quite different from those at the whole- 
genome scale. According to phylogenetic trees calculated from SNPs 
on these loci, most cultivars were clustered together, and the Or-III 
population from southern China tended to be the closest ancestral 
progenitor of all the cultivars (Supplementary Fig. 21). These patterns 
were much clearer when expanded to the total regions of the 55 
domestication-related loci (Fig. 4a). The indica cultivars were generally 
close to Or-I across the genome, but were closer to Or-IIa than to Or-I 
at the 55 selected loci (Supplementary Fig. 22). We further calculated 
the genetic distances between cultivated rice and the wild rice popula- 
tions from each geographic sampling region through analysis of the 
genomic regions around the 55 domestication-related loci (Sup- 
plementary Fig. 23). Our genetic approach showed that the middle 
area of the Pearl River in Guangxi province, southern China, was 
probably the place of the first development of cultivated rice 
(Fig. 4b-d), although an archaeological finding had identified the 
Lower Yangtze region in eastern China as one of the centres of rice 
cultivation'*'. These results suggest a model in which japonica was 
first domesticated from Or-III in southern China, and was subse- 
quently crossed to local wild rice in South East Asia and South Asia, 
thus generating indica after many cross-differentiation-selection cycles 
(Fig. 4e). Furthermore, we performed computational simulations to 
generate in silico data sets under various demographic scenarios 
(Supplementary Fig. 24), using the forward simulator SFS_CODE” 
(Supplementary Information section 5). We calculated the genetic 
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Figure 4 | Genetic and geographic origins of rice domestication. 

a, Phylogenetic tree of 446 O. rufipogon accessions and 1,083 O. sativa varieties 
calculated from SNPs in the overall regions of the 55 major domestication 
sweeps. b, Geographic locations of 62 O. rufipogon accessions, whose 
phylogenetic positions during domestication are indicated. Colour index 
represents the average of the genetic distance of O. rufipogon accessions to all 
cultivated rice accessions. Two major rivers in southern China are labelled in 
grey in the map. c, The average distance of O. rufipogon accessions from 
different countries to all cultivars. The distance was estimated by simple 
matching distance of SNPs around the Bh4 locus or all SNPs within the 55 
domestication sweeps. d, The average distance of O. rufipogon accessions from 
different provinces in southern China to all cultivars. e, Schematics of the origin 
of cultivated rice. The aus and aromatic rice are minor groups of rice accessions 
with small geographic distributions. 
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distances of indica and japonica with different clades of O. rufipogon 
for both the real and simulated data (Supplementary Fig. 25), which 
showed that there were significant differences between the real data and 
the simulated ones in all the scenarios except for the model proposed 
here (P< 0.01, rank sum test), providing further support for the model 
that we propose. 

To investigate why these chromosomal loci were selected during 
rice domestication, QTL mapping of domestication-related traits was 
performed using a population that was developed from a cross between 
O. sativa indica Guangluai-4 and O. rufipogon (Or-II]a accession 
W1943). Using a sequencing-based genotyping method’, we sequenced 
271 lines in the population with 0.5 coverage for each, and constructed 
an ultra-dense genotype map. A total of 15 domestication-related traits 
were phenotyped, with 58 QTLs detected using a composite interval 
mapping method (Supplementary Fig. 26 and Supplementary Tables 
12, 13). For ten QTLs for which causal genes have been reported, 
almost all the peak signals were within 200 kb from the known genes, 
indicating that a high mapping resolution was achieved (Supplemen- 
tary Fig. 27). 

Of the 58 QTLs detected, 32 QTLs were located within domestica- 
tion sweeps (Fig. 3b-d and Supplementary Figs 28-30). The overlap of 
the QTLs for domestication traits with domestication sweeps is very 
significant (P < 0.001, chi-square test). Among them, the QTLs for 
traits like exserted stigma (controlling mating system) and grain size 
(controlling output yield) show stronger signals than those for shat- 
tering (sh4) and plant architecture (PROG). Mating systems have 
been proposed to have a fundamental part in crop domestication”. 
We identified three major loci and two minor loci responsible for 
exserted stigma, and found that all the five QTLs were located within 
domestication sweeps. 


Identification of domestication-associated variants 


Characterization of domestication-associated genes and their genetic 
variation relies on high-quality genome sequences of both the culti- 
vated species and its immediate progenitor****. Hence, we further 
sequenced the O. rufipogon accession W1943 with 100-fold genome 
coverage and carried out de novo genome assembly. The total length 
of the assembly is 406 megabases (Mb) and the N50 length of the 
initial contigs was 16 kb. After aligning the assembly against the ref- 
erence genome of cultivated rice, we identified 2,621,077 SNPs, 
619,132 small indels and 140,075 structural variants of large size 
(Supplementary Table 14). We examined the sequence variants for 
their potential effects on protein coding, and identified a total of 
128,010 non-synonymous SNPs and 49,236 sequence variants with 
large effect. Moreover, we surveyed the allele information at all the 
polymorphic sites across all rice accessions, and constructed a com- 
prehensive sequence-variant frequency spectrum map. 

We investigated the patterns of sequence polymorphism to detect 
variants that affect gene coding and have differential frequency in dif- 
ferent populations. For the domestication loci with well-characterized 
genes, this approach allowed us to narrow down the causative poly- 
morphism to a limited number of sequence variants (Supplementary 
Figs 31-34). For the domestication loci for which known genes have not 
yet been identified, we found a total of 273 novel functional variants 
within 204 genes that showed high differentiation in allele frequencies 
between cultivated and wild rice (Supplementary Table 15). Moreover, 
we identified 305 sequence variants in the promoter regions with a 
differential frequency between cultivated and wild rice (Supplemen- 
tary Table 16), and 1,120 functional variants that were fixed in either 
japonica or indica panel (Supplementary Tables 17 and 18). 


Discussion 

Our study has provided new insights into how and where rice was likely 
to be domesticated, and we have identified a set of domestication 
sweeps and putative causal genes. Such endeavours will be enhanced 
by continuing improvements in assembly and annotation of wild rice 
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genome sequences, generation of functional genomics data sets of 
wild rice*’, advances in mapping of rice domestication traits’, and 
biological follow-up of the putative causal genes. The understanding 
of past domestication, including the selections on critical traits and the 
recent rapid speciation, will further guide future breeding efforts”. 
Moreover, the great diversity in the wild rice populations, which have 
much more natural allelic variation than domesticated rice, will further 
facilitate breeding to modify crops in the post-domestication era. 


METHODS SUMMARY 


The cultivated and wild rice accessions were all from large collections of rice 
accessions preserved at the China National Rice Research Institute in Hangzhou, 
China, and the National Institute of Genetics in Mishima, Japan. The DNA samples 
were sequenced on the Illumina Genome Analyzer IIx or HiSeq2000. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Sampling and sequencing. The cultivated and wild rice accessions were all from 
large collections of rice accessions preserved at the China National Rice Research 
Institute in Hangzhou, China, and the National Institute of Genetics in Mishima, 
Japan. The accessions were selected on the basis of the germplasm database 
records of phenotypic data and sampling localities to maximize genetic and 
geographic diversity. The collection was maintained by selfing in the laboratories. 
For each accession, genomic DNA from a single plant was used for sequencing, 
and seeds derived from the same plant were used for following field trials. In total, 
the genomes of 1,083 O. sativa accessions, 446 O. rufipogon accessions and 15 
accessions of outgroup species were sequenced on the Illumina Genome Analyzer 
IIx generating 73-bp (or 117-bp) paired-end reads, each to approximately onefold 
(for O. sativa accessions), twofold (for O. rufipogon accessions) or threefold 
(for outgroup species) coverage. The detailed information, including geographic 
origin and sequencing coverage of the rice accessions, was listed in Supplemen- 
tary Tables 2, 7 and 8. Library construction and sequencing of these accessions 
were performed as described*’. One representative accession of O. rufipogon, 
W1943, was sequenced on the Illumina HiSeq2000, generating 100-bp paired- 
end reads with 100-fold genome coverage. An amplification-free method of library 
preparation” was used in deep sequencing of the rice accession, which reduced the 
incidence of duplicate sequences, thus facilitating genome assembly and variation 
analysis. 

Read alignment and SNP calling. The paired-end reads of all the rice accessions 
were aligned against the rice reference genome (IRGSP 4.0) using the software 
Smalt (version 0.4) with the parameters of “-pair 50, 700° and ‘-mthresh 50’. SNPs 
were called using the Ssaha Pileup package (version 0.5) with detailed procedure 
described previously*’. Genotypes of the rice accessions, including 1,083 O. sativa 
accessions, 446 O. rufipogon accessions and 15 accessions of outgroup species, 
were further called at the SNP sites from the Ssaha Pileup outputs. The genotype 
calls in 15 accessions of outgroup species were used to determine the ancestral 
states of SNPs in O. sativa and O. rufipogon. SNPs in coding regions, which were 
defined based on the gene models in the RAP-DB (release 2), were then annotated 
to be synonymous or non-synonymous for calculating the non-synonymous/ 
synonymous ratio and dN/dS ratio. The genotype data set of the 1,529 rice 
accessions (1,083 O. sativa accessions and 446 O. rufipogon accessions) was 
generated on the basis of the calls in each rice accession. Seven sets of genome 
sequences, which included bacterial-artificial-chromosome-based Sanger sequences 
and high-coverage resequencing data, were used to assess the accuracy of the 
genotype data sets (Supplementary Table 6). The wild rice accessions with sequencing 
coverage >9 were selected to investigate the heterozygosity based on the over- 
lapped reads that were aligned onto the reference sequence. For each accession, the 
proportion of heterozygosity genotypes was calculated at the polymorphic sites 
(Supplementary Table 3). 

Population genetics analysis. The software Haploview was used to calculate 
linkage disequilibrium with default settings, using SNPs with information in 
446 O. rufipogon accessions™. Pairwise 1° was calculated for all the SNPs and 
then averaged across the whole genome. The matrix of pairwise genetic distance 
derived from simple SNP-matching coefficients was used to construct phylogenetic 
trees using the software PHYLIP®' (version 3.66). The software TreeView and 
MEGAS were used for visualizing the phylogenetic trees. Principal component 
analysis of the SNPs was performed using the software EIGENSOFT*. The 
sequence diversity statistics (z) and the population-differentiation statistics (Fsy) 
were computed using a 100-kb window. The value of 2 was calculated for each 
group in O. rufipogon and O. sativa, respectively, and the ratio of z in the full 
population (or each clade) of O. rufipogon to that in the full population (or corres- 
ponding subspecies) of O. sativa was used to detect selective sweeps. The genomic 
regions where both O. rufipogon and O. sativa show a low level of genetic diversity 
were excluded for further analysis. To adopt appropriate thresholds to reduce the 
false-positive rate but also retain true selection signals, thresholds were chosen on 
the basis of both whole-genome permutation tests and signals at known loci. 
Permutation tests were performed to estimate the genome-wide type I error rate 
and determine the threshold to call selective sweeps (see Supplementary Infor- 
mation section 2 for details)’. The method cross-population extended haplo- 
type homozygosity (XP-EHH) was also tested for detecting selective sweeps using 
the software xpehh” (http://hgdp.uchicago.edu/Software/) (Supplementary Fig. 20). 
The genetic distance between two clades was computed based on the matrix of 
pairwise genetic distance, where the distance of all pairs of accessions from the two 
clades were retrieved and averaged. A custom Perl script was developed to plot all 
O. rufipogon accessions, using the public geographic information of world borders 
from the “Thematic Mapping’ data set (version 0.3). The computational simula- 
tions under different demographic scenarios were performed using the program 
SFS_CODE”. 


Planting, crossing and phenotyping. For the O. rufipogon population, approxi- 
mately five seeds for each accession from the collection of wild rice were germi- 
nated and planted in the experimental field (in Sanya, China at N 18.65°, E 
109.80°) from March 2011. The leaf sheath colour was observed and scored 
directly and the tiller angle was measured for each plant. The mapping population 
of 210 backcross inbred lines (BILs) and 61 chromosome segment substitution 
lines (CSSLs) was derived from a cross between O. sativa ssp. indica cv. 
Guangluai-4 and O. rufipogon accession W1943. The BILs were developed by 
one generation of backcross to Guangluai-4 followed with six generations of 
self-fertilization. The CSSLs were developed by five generations of backcross to 
Guangluai-4 followed with three generations of self-fertilization. Phenotyping 
was conducted in the experimental field (in Shanghai, China at N 31.13°, E 
121.28°) from May to October, 2011. The fifteen traits that we phenotyped for 
this study include germination rate, tiller angle, heading date, stigma colour, the 
degree of stigma exsertion, plant height, panicle length, the degree of shattering, 
awn length, grain number per panicle, grain length, grain width, grain weight per 
1,000 grains, hull colour and pericarp colour. The degree of stigma exsertion was 
scored based on the observation of ~20 randomly sampled spikelets of each line, 
on a scale of 1-3 (no, incomplete or complete exertion). Seed germination rate 
was measured by using mature seeds which were placed in a plastic Petri dishes 
kept at 30°C in the dark for 48h’. Other traits were phenotyped and scored as 
described previously”!”*”’. 

Imputation and association analysis. For the genotype data set in O. rufipogon, 
genotypes of 446 O. rufipogon accessions were called specifically at the ~5 million 
SNP sites that were polymorphic in the O. rufipogon population. In the panel for 
GWAS, only the SNPs that have a minor allele frequency (MAF) of more than 5% 
and contain genotype calls of more than 100 accessions were left for subsequent 
imputation. The k-nearest neighbour algorithm-based imputation method was 
used for inferring missing calls*'. The specificity of the genotype data set before 
and after imputation was assessed using three sets of genome sequences (Sup- 
plementary Table 6). Association analysis was conducted using the compressed 
mixed linear model”. The top five principle components were used as fixed effects 
and the matrix of genetic distance was used to model the variance-covariance 
matrix of the random effect. Permutation tests were used to define the threshold of 
association signals of the GWAS in the wild rice population. A total of 20 per- 
mutation analyses were performed (10 independent permutation tests for each of 
the two traits, sheath colour and tiller angle), which resulted in two ‘association 
signals’ with the thresholds we set®. Hence, there were an average of 0.1 false 
positives (that is, totally two false positives in 20 permutation tests) in a single 
whole-genome scanning analysis. Simulation tests were used to compare the per- 
formance of GWAS between the populations of cultivated and wild rice. 
Genotyping and linkage analysis. Genomic DNA of each line in the mapping 
population was sequenced on the Illumina Genome Analyzer IIx, each to 
approximately 0.5 coverage. Both parents of the population, Guangluai-4 and 
W1943, were sequenced with at least 20X genome coverage, in a previous work"! 
and in this study, respectively. SNP identification between parents was conducted 
as described previously*’. Genotype calling, recombination breakpoint determina- 
tion and bin map construction was performed using the software SEG-Map 
(http://www.ncgr.ac.cn/software/SEG/). QTL analysis of the fifteen traits was con- 
ducted with the composite interval mapping (CIM) method implemented in the 
software Windows QTL Cartographer” (version 2.5) with a window size of 10 cM 
anda step size of 2cM. QTL with LOD value higher than 3.5 were called, of which 
the location was described according to its LOD peak location. The phenotypic 
effect (r°) of each QTL was computed using Windows QTL Cartographer. QTLs 
located within selective sweep regions were further used to associate the selected 
regions with their biological functions. It needs to be noted that we adopted a 
stringent threshold in the QTL calling (LOD > 3.5), and the genomic regions with 
LOD ranging from 2.5 to 3.5 may include many minor QTLs (the threshold was set 
to 2.5 in most studies). 

Genome assembly and contig anchoring. The genome of W1943 was assembled 
by using a custom pipeline integrating Phusion2 (clustering the raw reads into 
different groups)*” and Phrap (then assembling all the reads in each group to 
generate contigs)**. The N50 length of the entire assembly was calculated for the 
initial contigs with small contigs of <200 bp excluded. All the full-length com- 
plementary DNA sequences** of W1943 were aligned with the final assembly of 
W1943 genome sequence using the software GMAP” (version 6) with the para- 
meters ‘-K 15000’ and ‘-k 0.97’. The resulting contigs from whole-genome de novo 
assembly were anchored to the rice reference genome sequence (IRGSP4.0) using 
the software MUMmer® (version 3). 

Genome annotation and variant detection. Gene models of the genome of the 
wild rice W1943 were predicted using the software Fgenesh that was set for a 
monocot model (version 2.0). The resulting proteome of W1943 was compared 
with protein sequences in Rice Genome Annotation Project (version 7.0) using 
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BLASTP with a cutoff of a minimum of 95% identity. Sequence variants, including 
SNPs, indels and imbalanced substitutions, were called using the diffseq program 
in the EMBOSS package”. Indels of large size were called from the alignment 
results of MUMmer. Effects of the sequence variants were predicted according 
to the gene models of Nipponbare in the RAP-DB (release 2) across the rice 
genome. For indels in genic regions and SNPs with large effect around the domesti- 
cation loci, the effects were mainly based on the reference gene models. 
Population-scale sequence comparison. The sequence reads of 1,083 O. sativa 
accessions, 446 O. rufipogon accessions and 15 outgroup accessions were then 
aligned against assembled genome sequences of W1943 using the same parameters 
with those against the reference Nipponbare genome sequences. Genotypes of each 
accession were called at all sequence variant sites (including SNPs, indels and 
imbalanced substitutions that were detected from assembled sequences), based 
on the alignment outputs against the two genome sequences. The allele frequencies 
at the sequence variant sites were calculated for each clade of O. sativa and O. 
rufipogon. In each clade, variant sites with information of less than 10 accessions 
(less than 2 for the outgroups) were then excluded for computing allele frequencies, 
namely no data available. 
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Compensatory dendritic cell development 
mediated by BATF-IRF interactions 
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Peng Li*, Wei Liao*, Michael Behnke’, Samuel S. K. Lam', Cora T. Aurthur!, Warren J. Leonard*, Harinder Singh’, 

Christina L. Stallings’, L. David Sibley”, Robert D. Schreiber’ & Kenneth M. Murphy’? 


The API transcription factor Batf3 is required for homeostatic development of CD8a* classical dendritic cells that prime 
CD8 T-cell responses against intracellular pathogens. Here we identify an alternative, Batf3-independent pathway in 
mice for CD8a* dendritic cell development operating during infection with intracellular pathogens and mediated by the 
cytokines interleukin (IL)-12 and interferon-y. This alternative pathway results from molecular compensation for Batf3 
provided by the related AP1 factors Batf, which also functions in Tand B cells, and Batf2induced by cytokines in response 
to infection. Reciprocally, physiological compensation between Batf and Batf3 also occurs in T cells for expression of 
IL-10 and CTLA4. Compensation among BATF factors is based on the shared capacity of their leucine zipper domains to 
interact with non-AP1 factors such as IRF4 and IRF8 to mediate cooperative gene activation. Conceivably, manipulating 
this alternative pathway of dendritic cell development could be of value in augmenting immune responses to vaccines. 


BATF’ and BATF3 are activator protein 1 (AP1)’ transcription 
factors** with immune-specific functions *. Batf is required for 
development of T-helper cells producing IL-17 (T};17) and follicular 
helper T (Tp) cells’, and class-switch recombination (CSR) in B 
cells*’. Batf3 is required for development of CD8a" classical dendritic 
cells (cDCs) and related CD103* dendritic cells® that cross-present 
antigens to CD8 T cells’ and produce IL-12 in response to pathogens”. 

We recently recognized a heterozygous phenotype for Batf3 of 50% 
fewer CX3CR1 CD8«* cDCs in Batf3*'~ mice", indicating that 
levels of BATF3 are limiting for CD8«* dendritic cell development 
under homeostatic conditions. While analysing Toxoplasma gondii 
infection in Batf3-'~ mice, we observed evidence indicating Batf3- 
independent CD8«* cDC development” based on apparent priming 
of pathogen-specific CD8 T cells in Batf3‘~ mice treated with IL-12. 
Here we report a Batf3-independent pathway of CD8a* cDC develop- 
ment that functions physiologically during infection by intracellular 
pathogens and describe its molecular basis, which involves com- 
pensatory BATF factors. 


Pathogens and IL-12 restore CD8a* cDCs in Batf3~'~ mice 


IL-12 administration to Batf3"'~ mice before infection with type II 
Prugniaud (Pru) T. gondii reversed their susceptibility by inducing 
interferon (IFN)-y production not only from natural killer (NK) cells 
but also from CD8 T cells’’, indicating potentially restored cross- 
priming. To test this idea, we infected Batf3~’~ mice with the atte- 
nuated’? RHAku804rop5 strain of T. gondii and examined CD8«° 
cDCs (Supplementary Fig. 1a). Notably, CD8a* cDCs reappeared in 
spleens of Batf3-/~ mice by day 10 after infection. Infection by 
Listeria monocytogenes also restored CD8«' cDC production in 
Batf3_‘~ mice (Supplementary Fig. 1b). Aerosol-mediated infection 
with Mycobacterium tuberculosis caused a progressive restoration 
of CD8«* cDCs in Batf3-/~ mice and expanded CD8a* cDCs in 


wild-type mice (Fig. la). Mycobacterium tuberculosis infection of 
Batf3 '~ mice restored the missing lung-resident CD103* cDCs 
(DEC205'CD24*CD4 SIRP-« CD11b )*"3 (Supplementary Fig. 1c). 
Batf3-‘~ mice showed no difference in survival to M. tuberculosis 
infection compared to wild-type mice (Supplementary Fig. 1d), 
suggesting that the initial lack of CD8«* cDCs and peripheral 
CD103* cDCs was not sufficient for lethality with M. tuberculosis. 
Because IL-12 is important in control of M. tuberculosis'*, we measured 
serum IL-12 levels in M. tuberculosis-infected wild-type and Batf3/~ 
mice (Fig. 1b). IL-12 levels were reduced in Batf3-‘~ mice in the 
first 3 weeks, but increased after 6 weeks to approximately 50% of 
M. tuberculosis-infected wild-type mice. 

We found that IL-12 administration restored CD8«* cDC production 
in all backgrounds of Batf3'~ mice (Fig. 1c and Supplementary Fig. 2a). 
Restored CD8a* cDCs were functional for cross-presentation”'®"° 
(Fig. 2a and Supplementary Fig. 2b). Purified splenic CD8«* and 
CD4* cDCs from wild-type or Batf3"'~ mice, treated with vehicle or 
IL-12, were tested for cross-presentation. As a control, OT-I T cells 
proliferated in response to wild-type CD8a* dendritic cells, but not 
CbD4* dendritic cells, co-cultured with ovalbumin-loaded cells with or 
without IL-12 treatment (Fig. 2a). Importantly, OT-IT cells proliferated 
similarly in response to IL-12-induced Batf3‘~ CD8* dendritic cells, 
but not Batf3'~ CD4* dendritic cells, as to wild-type CD8a* cDCs. 
The CD8«* cDCs restored by IL-12 administration in Batf3~‘~ mice 
showed a gene expression profile that was more similar to wild-type 
CD8a* dendritic cells than to CD4* cDCs (Supplementary Fig. 2c). A 
total of 1,855 genes differed more than fourfold in expression between 
IL-12-induced CD8a,* cDCs and CD4* cDCs within Batf3~/~ mice. 
However, in comparing IL-12-induced CD8a* cDCs in Batf3"'~ mice 
with the rare CD8a* cDCs from Batf3' ~ C57BL/6 mice, or with 
CD8«* cDCs in wild-type mice, there were only 34 and 206 genes, 
respectively, differing more than fourfold in expression. 
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Figure 1 | Intracellular pathogens or IL-12 restore lymphoid CD8a* cDCs 
and tissue-resident CD103* cDCs in Batf3"/~ mice. a, Wild type (WT) and 
Batf3' ~ 129SvEv mice were uninfected (Ctrl) or infected with M. tuberculosis, 
and spleens collected and analysed by FACS at the indicated time. Histograms for 
indicated markers are gated as autofluorescent’ MHCII™®"CD11c* cells. 

Numbers are per cent of cells in the gate. b, Serum IL-12 level was measured from 
individual mice (a) at the indicated time. c, Wild-type and Batf3"'~ 129SvEv mice 
were treated with vehicle (PBS) or IL-12 and analysed by FACS after 3 days as ina. 


The IL-12-induced restoration of CD8«* cDCs in Batf3~'~ mice 
was dependent on IFN-y in vivo (Supplementary Fig. 2d). With 
administration of control antibody, IL-12 induced a threefold increase 
in CD8a* cDCs in wild-type mice and restored CD8«* cDC produc- 
tion in Batf3-'~ mice. Both effects were blocked by anti-IFN-y 
antibody. IL-12 induced IFN-y production from NK cells but not T 
cells (Supplementary Fig. 2e), and IL-12 treatment was able to restore 
CD8«* cDCs in Rag2 '~ Batf3~'~ mice (Supplementary Fig. 2f), 
indicating that T cells and B cells are not required for this effect. 

Batf3'~ mice fail to reject highly immunogenic H31m1 and D42m1 
fibrosarcomas’. However, Batf3 / ~ mice pre-treated with IL-12 either 
fully rejected or showed reduced growth of these fibrosarcomas (Fig. 2b 
and Supplementary Fig. 2g). Rejection was not simply due to NK cell 
activation, as IL-12 treatment failed to alter tumour growth in Rag? /~ 
mice. Notably, IL-12-treated Batf3~'~ mice re-established priming of 
CD8 T cells capable of infiltrating tumours similar to wild-type mice 
(Fig. 2c). Mice with the inactivating IRF8 R249C mutation’”” failed 
to restore CD8x" cDCs upon IL-12 administration (Supplementary 
Fig. 3a), indicating that restoration requires functional IRF8 protein. 
Furthermore, these mice were highly susceptible to infection by aerosol- 
administered M. tuberculosis*® (Supplementary Fig. 3b), indicating that 
Batf3-'~ mice may resist M. tuberculosis infection by restoration of 
CD8«* cDCs. Thus, physiological Batf3-independent development of 
CD8«* cDCs is induced by pathogens, mediated by IL-12 and IFN-y. 


Batf, Batf2 and Batf3 cross-compensate 

We next investigated whether Batf could replace Batf3 for in vitro cDC 
development” (Fig. 3a). CD103*SIRP-« cDCs do not develop in 
Fit3L-treated Batf3 ‘~ bone marrow cultures. Retroviral expression of 
Batf3 into Batf3 ‘~ bone marrow progenitors restored CD103* cDC 
development. Notably, Batf expression fully and cell-intrinsically 
restored CD103*SIRP-« cDC development, whereas Fos was 
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Figure 2 | IL-12-induced CD8a* cDCs in Batf3~'~ mice can cross-present 
and mediate tumour rejection. a, From mice in Fig. 1c, dendritic cells were 
purified by sorting as CD3- DX5. MHCII* CD11c™ SIRP-«. CD24*DEC205* 
dendritic cells (CD8DC) and CD37 DX5° MHCII* CD11c* SIRP- 

a*CD24~ DEC205~ dendritic cells (CD4DC) and assayed for cross- 
presentation’. OT-I proliferation in response to cDCs mixed with the indicated 
number of MHC-class-I-deficient ovalbumin (Ova)-loaded splenocytes is 
shown. b, Wild-type or Batf3"'~ mice treated with vehicle (PBS) or with IL-12 
were inoculated with 1 X 10° H31m1 fibrosarcomas. Tumour size in individual 
mice is shown. c, Mice in b were analysed by FACS 11 days after H31m1 
inoculation for CD8 T-cell infiltration into tumours’. 


inactive (Supplementary Fig. 3c). CD103" cDCs restored by Batf 
and Batf3 expression in vitro were functional, showing features of 
mature CD103~* cDCs, including loss of SIRP-« and CD11b expres- 
sion, upregulation of CD24, and selective production of IL-12 in res- 
ponse to T. gondii antigen (Supplementary Fig. 3c, d). Reciprocally, 
Batf3 but not Fos restored cell-intrinsic IL-17a production by Batf /~ 
T cells and CSR in Batf ‘~ B cells (Supplementary Fig. 3e, f). Thus, 
Batf and Batf3 can molecularly cross-compensate for several distinct 
lineage-specific functions, activities not shared by Fos. 

We also identified in vivo compensation between Batf and Batf3 in 
dendritic cells. On the 129SvEv and BALB/c backgrounds, Batf3 a 
mice completely lack CD8a* cDCs (Fig. 3b and Supplementary Fig. 2a). 
In contrast, C57BL/6 Batf3 '~ mice retain a 3-7% population of CD8a~ 
cDCs"’”? in spleen and unexpectedly retain a completely normal popu- 
lation of CD8«* cDCs in skin-draining inguinal lymph nodes (ILNs) 
(Fig. 3b). These CD8x* cDCs are functional, as C57BL/6 Batf3'~ mice 
showed robust priming of CD8 T cells against herpes simplex virus 
(HSV) infection in the footpad, whereas 129SvEv Batf3 aa mice, lacking 
CD8a* cDCs in ILNs, did not (Supplementary Fig. 3g). However, 
CS57BL/6 Batf '~ Batf3-/~ mice lack CD8«* dendritic cells in ILNs 
and fail to prime T cells against HSV. Thus, retention of functional 
CD80" cDCs in ILNs of C57BL/6 Batf3'~ mice is Batf-dependent. 

Batf and Batf3 also compensate in expression of genes by T cells. 
IL-4 and IL-10 production was not substantially affected in either 
Batf ‘~ or Batf3 ‘~ Ty2 cells (Fig. 3c and Supplementary Fig. 3i-k). 
However, IL-10 is reduced at least eightfold in Batf '~ Batf3-'~ Ty2 
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Figure 3 | Batf compensates for CD8a* cDC development in Batf3~'~ 
mice. a, Wild-type or Batf3_‘~ bone marrow cells were infected with GFP- 
RV* (empty) or retrovirus expressing the indicated cDNA and cultured with 
FIt3L”. Histograms for the indicated markers are for B220. CD11c* cells on day 
10. Numbers are the per cent of cells in the gate. b, Inguinal lymph nodes from 
wild-type, Batf3 '~ or Batf ‘ Batf3 ‘~ mice on 129SvEv or C57BL/6 
backgrounds were analysed by FACS. Shown are histograms for DEC205 and 
CD8«. c, CD4 T cells of the indicated genotype were differentiated twice under 
Ty2 conditions’ and analysed by FACS for intracellular IL-10. 


cells. Also, expression of the inhibitory receptor CTLA4 is partially 
reduced in Batf-'~ Ty2 cells but reduced threefold further in 
Batf ‘~ Batf3-‘~ Ty2 cells. In contrast, IFN-y production by Ty] cells 
is unaffected by loss of Batf, Batf3 or both. 

We asked whether IL-12-induced restoration of CD8«* cDCs 
in Batf3'~ mice was due to compensation by Batf (Supplementary 
Fig. 3h). Restoration of splenic CD8«* cDCs in IL-12-treated 
Batf '~ Batf3'~ mice was reduced to 5% from 11% in IL-12-treated 
Batf3_‘~ mice. Whereas Batf seems to be responsible for roughly half 
of the IL-12-induced restoration of CD8a* cDCs in Batf3'~ mice, 
some residual compensation remained in Batf '~ Batf3'~ mice. We 
asked if third factor might compensate for Batfand Batf3. Batf2 (also 
called SARI)” is closely related to Batf and Batf3 and is induced by 
lipopolysaccharide (LPS) and IFN-y in macrophages and CD103~ 
dendritic cell populations (Supplementary Fig. 4a—c). We found that 
Batf2 was induced by IFN-y in wild-type and Batf3"‘~ dendritic cells 
derived from Flt3L-cultured bone marrow (Supplementary Fig. 4d, e) 
and in vivo by IL-12 in dendritic cells in Batf3_'~ mice (Supplemen- 
tary Fig. 4f). Induction of Batf2 by IFN-y in cDCs made it a potential 
candidate to mediate IFN-y-dependent compensation for Batf3. 

We generated Batf2-‘~ mice by targeting exons 1 and 2, eliminat- 
ing BATF2 protein expression (Supplementary Fig. 5). Batf2-‘~ mice 
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had normal development of NK, T and B cells, pDCs, neutrophils, 
resting cDCs, and peritoneal, liver and lung macrophages (Sup- 
plementary Fig. 6a-c). However, Batf2-/~ mice displayed signifi- 
cantly decreased survival after infection by T. gondii (Pru) (Fig. 4a), 
although parasite burden and serum cytokine levels were similar to 
wild-type mice (Supplementary Fig. 7a, b). Notably, Batf2~'~ mice 
showed _ significantly decreased numbers of lung-resident 
CD103*CD11b~ dendritic cells and CD103" CD11b” macrophages 
after infection (Fig. 4b and Supplementary Fig. 7c). These changes 
were specific, as there were no differences in other myeloid subsets 
(Supplementary Fig. 7d, e). Thus, Batf2 has a role in maintaining 
numbers of Batf3-dependent CD103" dendritic cells in the lung after 
infection with T. gondii. 

We next investigated whether Batf2 could compensate for dendritic 
cell defects in Batf3-‘~ mice and for T and B cell defects in Batf ‘~ 
mice (Fig. 4c and Supplementary Fig. 8a, b). Like Batf, retroviral 
expression of Batf2 restored development of CD103* SIRP-o. dendritic 
cells in Fit3L-treated Batf3-'~ bone marrow. Unlike Batf and Batf3, 
expression of Batf2 did not restore T17 development in Batf ‘~ T cells 
and only weakly restored CSR in Batf '~ B cells. Thus, Batf2 selectively 
compensates for Batfand Batf3 in cDCs but not in T or B cells. We next 
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Figure 4 | Batf2 compensates for Batf3 in CD8a* and CD103* cDC 
development during T. gondii infection. a, Wild-type and Batf2-‘~ mice 
were infected with T. gondii and monitored for survival. n = 29 for wild-type 
(dashed line) and Batf2-' ~ (solid line) mice. b, Shown are percentages of lung 
CD103* dendritic cells of total CD45.2* cells for uninfected and infected (T. 
gondii) wild-type or Batf2-‘~ mice on day 10. n = 5 from one of three 
experiments. c, Wild-type or Batf3‘~ bone marrow cells were infected with the 
indicated retrovirus, cultured with Flt3L and analysed by FACS on day 10. 

d, Groups of five mice each of the indicated genotypes were treated with vehicle 
(PBS) or IL-12 and analysed by FACS after 3 days. Shown are percentages and 
s.e.m. of CD8a* cDCs as a total of splenic cDCs. 
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examined in vivo IL-12-induced restoration of CD8«* cDCs in 
Batf2-'~, Batf3-'~ and Batf2-'~ Batf3"'~ mice (Fig. 4d and Sup- 
plementary Fig. 8c). IL-12 treatment restored CD8«* cDCs from 
<1% to >5% of total cDCs in Batf3'~ mice, but this was significantly 
reduced to ~2% in Batf2”'~ Batf3‘~ mice. This suggests that Batf2 is 
responsible for roughly half of IL-12-induced CD8«* cDC restoration 
in Batf3"'~ mice. We found similar results with in vitro CD103* cDC 
development. Whereas granulocyte-macrophage colony-stimulating 
factor (GM-CSF) restored only CD103 and not DEC205 expression in 
Fit3L-treated Batf3-/~ bone marrow cultures"’, adding IFN-y with 
GM-CSF restored DEC205*CD103*CD11b~ dendritic cells (Sup- 
plementary Fig. 8d, e). We used this system to analyse compensation 
between Batf, Batf2 and Batf3 (Supplementary Fig. 8d, e). Relative to 
wild-type bone marrow, CD103"DEC205°CD11b cDCs were 
partially reduced in Batf '~, Batf2~'~ and Batf3~'~ singly deficient 
bone marrow, but were reduced to an even greater extent in both 
Batf ‘~ Batf3-‘~ and Batf2-‘~ Batf3-‘~ bone marrow relative to all 
single-deficient bone marrow cultures. Collectively, these results 
suggest that Batf and Batf2 both act in the cytokine-dependent rescue 
of CD8«* cDC development in Batf3~‘~ mice. 


The BATF LZ domain interacts with IRF4 and IRF8 


To understand BATF cross-compensation, we analysed chimaeric 
proteins containing fused domains of BATF, BATF2 and FOS 
(Supplementary Fig. 9). FOS inactivity could result from inhibitory 
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actions of amino- and carboxy-terminal domains missing in BATF 
and BATF3 (Fig. 5a). However, removing these domains from FOS 
(A5’FOSA3’) or fusing the BATF DNA-binding domain (DBD) with 
the FOS leucine zipper (LZ) (BBRFFA3’) failed to restore CD103* 
cDC development (Fig. 5a), T}17 development or CSR activity 
(Supplementary Fig. 10a, b). However, fusing the BATF LZ with the 
FOS DBD (A5’FFQBB) fully restored CD103* cDC development in 
Ftl3L-treated Batf3-‘~ bone marrow cultures, fully restored CSR in 
Batf ‘~ B cells and exhibited 20% of wild-type activity for restoring 
IL-17 production. Furthermore, BATF with a C-terminal GFP 
(BATF-GFP) or FOS (BBBF) domain had >40% of wild-type 
BATF activity in all three assays (Supplementary Fig. 11). Thus, the 
LZ domain, not the DBD, determines BATF specificity. 

Removing the BATF2 C-terminal domain (BATF2 DM) allowed 
for full restoration of CD103* cDC development, but led to only 
partial restoration of CSR activity (~15%), and no restoration of 
TyI17 development (Supplementary Fig. 10c-h). Fusing the BATF 
DBD with the BATF2 LZ (B1HB2 DM) allowed for approximately 
50% of wild-type BATF activity for CSR and Ty17 development 
in Batf‘~ B and T cells. Fusing the BATF2 DBD with BATF LZ 
(B2QB1) allowed for 50% of wild-type BATF activity in CD103* 
cDC development, but provided less than 5% wild-type BATF 
activity in T}17 development. Thus, the BATF2 DBD restricts activity 
in T and B cells, but its LZ functions for all BATF lineage-specific 
activities. 
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Figure 5 | BATF leucine zipper interactions with non-AP1 factors mediate 
lineage-specific actions. a, Structures of chimaeric proteins are shown belowa 
diagram of FOS. DNA binding domain (DB), hinge (H), leucine zipper (LZ), N 
(5’) and C termini (3’) are indicated. Flt3L-treated wild-type or Batf3 ‘~ bone 
marrow cells infected with the indicated retrovirus were analysed after 10 days. 
b, 293FT cells expressing both Batf and Irf4 (upper panel) or Batf and Irf4 as 
indicated (lower panel) were analysed by EMSA with the indicated probes and 


antibodies c, 293FT cells expressing IRF4 (+) and the indicated BATF 
chimaera were analysed by EMSA with the AICE1 probe. d, B cells were 
analysed by EMSA with the indicated probe and competitor oligonucleotides 
(comp). e, Batf3 ~ bone marrow cells infected with the indicated BATF 
retroviruses encoding BATF were analysed as in a. f, 293FT cells expressing the 
indicated BATF mutants were analysed by EMSA with the AICE1 probe. 
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A recent study” proposed that IRF4 and BATF may interact. 
Correspondence between the phenotypes of Batf ‘~ with Irf4-'~ 
mice, lacking T}17 development and CSR, and Batf3‘~ with 
Irf8'~ mice, lacking CD8a* cDC development (Supplementary 
Fig. 12)°’***8, suggested that BATF and BATF3 may cooperate with 
both IRF4 and IRF8. Furthermore, ChIP-seq analysis of BATF and 
IRF4 revealed coincident binding to composite AP1 and IRF 
motifs”*°. By analogy with the ETS-IRF composite elements 
(EICE)*’, the AP1-IRF composite elements are designated as 
AICE””°, Recalling the FOS LZ interaction with NF-AT that mediates 
cooperative binding to I/2 regulatory regions’, we therefore asked 
whether the BATF LZ interacted with non-AP1 factors, including 
IRF4. 

Electrophoretic mobility shift assays (EMSAs) demonstrated 
interactions between BATF and both IRF4 and IRF8 (Fig. 5 and 
Supplementary Figs 13 and 14). The BATF-JUN complex that formed 
on an API consensus probe’” was unchanged by addition of IRF4 or 
IRF8. Its abundance was increased by additional JUNB (Supplemen- 
tary Fig. 13a). However, using an AICE from the Ctla4 locus, a slower 
mobility complex formed with addition of either Irf4 or Irf8, which 
required the IRF consensus element (Fig. 5b and Supplementary 
Fig. 13b). This BATF-JUN-IRF4 complex required both BATF and 
IRF protein, and IRF4 was unable to bind the AICE1 probe without 
BATF (Fig. 5b). In contrast, IRF4 was able to bind to an ETS-IRF 
consensus element* (EICE) in the presence of PU.1 independently of 
BATF (Fig. 5b). The BATF-JUN-IRF4 complex had similar mobility 
to the FOS-JUN complex in activated cells, but could be distinguished 
by antibody supershifts (Supplementary Fig. 13c). Spatial constraints 
on the BATF-IRF4 interaction are suggested by lack of BATF-IRF4 
complex formation on the AICE2 probe with reversed orientation of 
the API and IRF elements (Supplementary Fig. 13a, c). 

The ability of chimaeric BATF proteins to interact with IRF4 
correlated with their functional activity (Fig. 5c). The active 
A5'FFQBB interacted with IRF4, but the inactive A5’FOSA3’ and 
BBRFFA3’ did not (Fig. 5c), although all three bound an AP1 
consensus (Supplementary Fig. 13d). A BATF-IRF complex also 
formed in B and T cells, requiring both the IRF and AP1 consensus 
elements (Fig. 5d and Supplementary Fig. 14c), and involved contri- 
bution by endogenous IRF4 and IRF8 (Supplementary Fig. 14a, b, d). 
The BATF-IRF complex was completely absent in Batf _'~ B cells, but 
was partially retained in single-deficient Irf4-'~ or Irf8-/~ B cell 
extracts (Supplementary Fig. 14a, b). Supershifts using anti-IRF4 
and anti-IRF8 antibodies in Irf4-/~ or Irf8‘~ B cell nuclear extracts 
showed that the endogenous BATF-IRF complex is composed of a 
mixture of IRF4 and IRF8 (Supplementary Fig. 14b). Similarly, super- 
shifts of primary Ty17 cells demonstrate an interaction between 
BATF and both IRF4 and IRF8 (Supplementary Fig. 14d). 

We identified BATF LZ mutations which abrogated both function 
and IRF4 interactions (Fig. 5e, f). Positions b, c and f of the FOS LZ 
face away from the parallel coiled-coil of the JUN LZ and mediate 
interactions with NF-AT* (Supplementary Fig. 9d). Several BATF 
mutations in these positions had no effect on activity (Supplemen- 
tary Table 1 and Supplementary Fig. 9c). However, four residues 
together controlled nearly all BATF-specific activity (Fig. 5e and 
Supplementary 15). BATF(L56A), BATF(K63D) and BATF(E77K) 
showed no reduction in CD103* cDC development, and retained 
>60% of wild-type BATF activity in CSR. In contrast, BATF(H55Q) 
activity was reduced by nearly 70% (Fig. 5e). Double mutants 
BATF(K63D/E77K) and BATF(H55Q/L56A) were reduced by 50% 
and 75%, but quadruple mutant BATF(H55Q/L56A/K63D/E77K) 
had less than 10% of wild-type BATF activity in CD103* cDC develop- 
ment (Fig. 5e). This loss of activity was specific, as two other quadruple 
mutants—BATF(E59Q/D60Q/K63D/E77K) and BATF(K63D/R69Q/ 
K70D/E77K)—maintained >50% of wild-type BATF activity 
(Supplementary Fig. 15c, d). A similar pattern was seen for CSR in 
Batf ‘~ Bcells (Supplementary Fig. 15b). The activity of these mutants 
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correlated with IRF4 interaction (Fig. 5f). Wild-type BATF and func- 
tional mutant BATF(K63D/E77K), and both functional quadruple 
mutants, formed a complex with IRF4, whereas the less active 
BATF(H55Q) and BATF(H55Q/L56A), and the completely inactive 
BATF(H55Q/L56A/K63D/E77K), did not, although all were stably 
expressed and could bind an AP1 site (Supplementary 13¢, f). 


Discussion 


We have uncovered a cytokine-driven pathway for expansion of func- 
tional CD8a* cDCs occurring physiologically during infection by 
intracellular pathogens. This pathway relies on functional compensa- 
tion for Batf3 provided by Batf and Batf2 through a shared specificity 
defined by the BATF LZ domain to support CD8«* cDC and Ty17 
development, and CSR in B cells. This compensatory pathway of 
CD8«* cDC development may provide a basis for augmenting 
therapeutic immune responses. The basis of this shared specificity 
is the LZ interaction with non-AP1 factors, including IRF4 and 
IRF8, extending the repertoire of AP1 interactions beyond the 
recognized association of FOS with NF-AT*. The ability of both 
BATFE and BATF3 to support both IRF4- and IRF8-dependent lineage 
activities implies that the specificity for gene activation is determined 
largely by the IRF factors. An important next step will be to determine 
how regulatory elements discriminate between these distinct complexes. 


METHODS SUMMARY 

Mice. Batf '~, Batf2-'~ and Batf3'~ mice and intercrosses were bred in our 
specific pathogen-free animal facility according to institutional guidelines. 
Irf8-'~ mice from the European Mutant Mouse Archive and BXH2/TyJ mice 
from The Jackson Laboratory were maintained on the C57BL/6 background. 
Germline-deleted Irff’‘~ mice were produced from crossing B6.129S1- 
rrfaer Py mice obtained from The Jackson Laboratory with a CMV-Cre deleter 
strain on the C57BL/6 background. Experiments were in accordance with 
procedures approved by the AAALAC accredited Animal Studies Committee 
of Washington University in St Louis. 

In vivo IL-12 treatment. Recombinant murine IL-12 (Peprotech and Pfizer) was 
suspended in pyrogen-free saline at 2.5 jg ml’. Mice were injected intraperitoneally 
with 0.5 ig of IL-12 two times 3 days before analysis. For tumour transplantation and 
tumour collection experiments, mice were treated on days 4, 3 and 1 before tumour 
inoculation and 24h after tumour inoculation. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Generation of Batf2~'~ mice. The targeting construct was assembled by 
Gateway recombination cloning system (Invitrogen). To construct pENTR 
loxFRT rNEO, a floxed PGK-neo’ (phosphoglycerate kinase promoter-neomycin 
phosphotransferase) gene cassette (1,982bp) was excised from the pLNTK 
targeting vector*® using Sall and Xhol. After incubation with Easy-A cloning 
enzyme (Stratagene) and dNTPs to generate 3’-A overhangs, the PGK-neo" 
cassette was ligated into the multiple cloning site of pGEM-T Easy (Promega). 
The resulting 2,022-bp PGK-neo" gene cassette was released using NotI and 
ligated into the 2,554-bp backbone of NotlI-digested pENTR lox-Puro”. 
Flippase Recognition Target (FRT) sites were sequentially inserted at the SacI 
and HindIII sites using DNA fragments generated by following annealing 
oligonucleotides: SacII-FRT-A (5'-GAAGTTCCTATTCCGAAGTTCCTATTC 
TCTAGAAAGTATAGGAACTTCCGC-3’) and SacII-FRT-B (5'-GGAAGT 
TCCTATACTTTCTAGAGAATAGGAACTTCGGAATAGGAACTTCGC-3’), 
or HindIII-FRT-A (5'-AGCTTGAAGTTCCTATTCCGAAGTTCCTATTCTC 
TAGAAAGTATAGGAACTTC-3’) and HindIII-FRT-B (5'-AGCTGAAGTTC 
CTATACTTTCTAGAGAATAGGAACTTCGGAATAGGAACTTCA-3’). 

To construct pENTR-BATF2-5HA, the 5’ homology arm was generated by 
PCR from genomic DNA using the following oligonucleotides, which contain the 
attB4 and attB1r site: 5’-GGGGACAACTTTGTATAGAAAAGTTGGCAGGCT 
GAAGCAGGGACAC-3’, and 5'-GGGGACTGCTTTTTTGTACAAACTTGC 
CCTAGCACACCCAGTTTCAGTTTC-3’. The attB4-attBlr PCR fragment 
was ligated into pDONR(P4-P1R) plasmid (Invitrogen) by BP recombination 
reaction. To construct pENTR-BATF2-3HA, the 3’ homology arm was generated 
by PCR from genomic DNA using the following oligonucleotides which contain 
attB2 and attB3site: 5'-GGGGACAGCTTTCTTGTACAAAGTGGGCAGTCC 
CAGCATGACCCAGCCCCAGCATGACCCAGCATCTGC-3’, and 5’-GGGG 
ACAACTTTGTATAATAAAGTTGCCGTCTCACTCAGTTCTGTGTGTG-3’. 
The attB2-attB3 PCR fragment was ligated into pDONR(P2-P3) plasmid 
(Invitrogen) by BP recombination reaction, following by the LR recombination 
reaction to generate final targeting construct by using pENTR-BATF2-5HA, 
pENTR-BATF2-3HA, pENTR-loxFRT rNEO, and pDEST DTA-MLS. The 
linearized vector was electroporated into EDJ22 embryonic stem cells, on the 
129 SvEv background, and targeted clones were identified by Southern blot 
analysis with 5’ and 3’ probes. Blastocyst injections were performed, and male 
chimaeras were bred to female 129S6/SvEv mice. Probes for Southern blot 
analysis were amplified from genomic DNA using the following primers: 
5'-GTTGGTGTGAGGTGATGAGGTCCC-3’ and 5'-CTTGACTTCCTAGAC 
CAGGGGC-3' for the 5’ probe; 5’-GGACCATATACTCTTACTGTTGAAA 
CC-3' and 5'-GGGCTGGGGACACACATG-3’ for the 3’ probe. Southern blot 
analysis and genotyping PCR were performed to confirm germline transmission 
and the genotype of the progeny. Primers used for genotyping PCR are shown as 
follows: knockout screen forward, 5'-GAAACTGAAACTGGGTGTGCTA 
GGG-3'; knockout screen reverse, 5’- CGCCTTCTTGACGAGTTCTTCTG 
AG-3'; wild-type screen reverse, 5’-GCCTTCCTTGTCTCTCTCCATAGCG-3’. 
Generation of BATF2-specific rabbit polyclonal antibody. Murine full-length 
Batf2 cDNA was cloned into the pET-28a (+) expression vector (Novagen) to 
generate recombinant BATF2 protein with a His tag. Recombinant BATF2 was 
then expressed using Escherichia coli BL21 (Invitrogen). Purified recombinant 
BATE2 was used to immunize New Zealand white rabbits (Harlan), and rabbit 
anti-mouse BATF2 sera were collected and tested by ELISA and western blot 
analysis. 

Pathogen infections. Listeria monocytogenes and T. gondii infections were 
carried out as previously described**”’. The type II avirulent Prugniaud (Pru) 
strain of T. gondii expressing a firefly luciferase and GFP transgene (provided by 
J. Boothroyd) was used for infections of Batf2”'~ mice for the experiments shown 
in Fig. 4 and Supplementary Fig. 7. Briefly, 800 tachyzoites were injected 
intraperitoneally (i.p.) into mice. For herpes simplex virus 1 (HSV-1) infections, 
mice were infected with the KOS strain subcutaneously (s.c.) with 1.5 X 10° 
plaque-forming units (p.f.u.) per mouse in the footpad. Viral supernatant and 
HSV peptide were provided by M. Colonna. For M. tuberculosis infections, 
Erdman strain was grown to a density of 8 X 10° colony-forming units (c.f.u.) per 
ml, and mice were exposed to aerosol infection for 40 min using a Glas-Col 
aerosol exposure system. After 24h, two mice per group were killed, and lungs 
were collected to determine infection efficiency, which was about 100 c.f.u. per 
lung. Lungs and spleens were collected at 1, 3 and 9 weeks after infection. The left 
lobe of the lung and one-third of the spleen were used for histological examina- 
tion upon formalin fixation. The right lobe of the lung and two-thirds of the 
spleen were divided into c.f.u. analysis and FACS analysis. For c.f.u. analysis, 
organs were homogenized and plated at various dilutions on 7H-10 culture plates, 
and after 3 weeks, colonies were counted. For FACS analysis, organs were 
prepared as described". For lungs, Ficoll gradient purification was performed 


before surface staining. All samples were fixed upon staining for 20 min in 4% 
formalin. Serum was collected by retro-orbital bleeding at week 1, 2, 3 and 6 and 
decontaminated by double filtering through a 0.2-mm filter. 

Bioluminescence imaging. Imaging was done as previously described*". In brief, 
mice were injected intraperitoneally with p-luciferin (Biosynth AG) at 150 mg kg! 
and allowed to remain active for 5min before being anaesthetized with 2% 
isoflurane for 5 min. Animals were than imaged with a Xenogen IVIS200 machine 
(Caliper Life Sciences). Data were analysed with the Living Image software (Caliper 
Life Sciences). 

In vitro T-cell re-stimulation after HSV infection. One week after infection, 
spleens were collected and re-stimulated with the HSV peptide HSV-gB2 (498-505) 
(Anaspec). Briefly, 2X 10° splenocytes were re-stimulated for 5 h in the presence of 
brefeldin A at 1 jtgml”’ and analysed by FACS for intracellular IFN-y and TNF-« 
production as described later. 

In vitro cross-presentation. Dendritic cell cross-presentation of antigen to 
CD8* OT-I T cells was assessed as previously described’. Briefly, spleens from 
naive, vehicle or IL-12-treated wild-type or Batf3 /~ mice were digested with 
collagenase B (Roche) and DNase I (Sigma-Aldrich). CD1 1c* dendritic cells were 
obtained by negative selection using B220, Thy1.2, and DX5 microbeads followed 
by positive selection with CD11c microbeads by MACS purification (Miltenyi 
Biotec). CD8«* and CD4* cDCs were cell sorted on a FACSAria II flow 
cytometer (post-sort purity >96%). Splenocytes from Ke!" p’'- Bam '~ mice 
were prepared in serum-free medium, loaded with 10 mg ml’ ovalbumin (EMD) 
by osmotic shock, and irradiated (13.5Gy) as described previously’. OT-I 
T cells were purified from OT-I/Rag2 ‘~ mice by CD11c and DX5 negative 
selection, followed by positive selection with CD8« microbeads (purity >96%). 
T cells were fluorescently labelled by incubation with 1 1M CFSE (Sigma-Aldrich) 
for 9 min at 25°C at a density of 2X 10” cells ml. For the cross-presentation 
assay, 5 X 10*-10° purified dendritic cells were incubated with 5 x 10*-10° 
CFSE-labelled OT-I T cells in the presence of increasing numbers of irradiated, 
ovalbumin-loaded Kp Bom splenocytes (5 X 10° to 1 X 10°). After 
3 days, OT-I T cell proliferation was analysed by CSFE dilution gating on 
CD3*CD8*CD45.1* cells 

Tumour transplantation. Methylcolanthrine-induced fibrosarcomas were 
derived from 129SvEv strain Rag2 / ~ or wild-type mice as described previ- 
ously“, Tumour cells were propagated in vitro and 10° tumour cells (129SvEv 
fibrosarcoma tumour line H31m1 or D42m1) were injected s.c. in a volume of 
150 ul endotoxin-free PBS into the shaved flanks of naive or IL-12-conditioned 
(as described above) wild-type Batf3 '~ and Rag2 ~‘~ recipient mice as described 
previously’. Injected cells were >90% viable as assessed by trypan blue exclusion. 
Tumour size was measured on the indicated days and is presented as the mean of 
two perpendicular diameters. 

Tumour harvest. Wild-type and naive or IL-12-treated Batf3 ‘~ mice were 
inoculated with the H31m1 fibrosarcoma tumour line. Eleven days after, tumours 
were removed, minced and treated with 1 1g ml‘ type A collagenase (Sigma) in 
HBSS (Hyclone) for 1 h at 37 °C. Cell suspension was analysed by FACS for CD8 
T-cell recruitment at the tumour site gating on CD45*, Thy1.2* and CD8a* 
cells. 

Quantitative RT-PCR. For gene expression analysis, RNA was prepared from 
various cell types with either RNeasy Mini kit (Qiagen) or RNeasy Micro kit 
(QIAGEN), and cDNA was synthesized with Superscript III reverse transcription 
(Invitrogen). Real-time PCR and a StepOnePlus real-time PCR system (Applied 
Biosystems) were used according to the manufacturer’s instructions, with the 
Quantitation Standard-Curve method and HotStart-IT SYBR Green qPCR 
Master Mix (Affymetrix/USB). PCR conditions were 10 min at 95 °C, followed 
by 40 two-step cycles consisting of 15s at 95 °C and 1 min at 60 °C. Primers used 
for measurement of Batf2 expression are as follows: Batf2 qRT forward, 
5'-GGCAGAAGCACACCAGTAAGG-3’; Batf2 qRT reverse, 5'-GAAGGGC 
GTGGTTCTGTTTC-3’. Hprt was used as normalization control, and primers 
used are as follows: Hprt forward, 5'-TCAGTCAACGGGGGACATAAA-3’; 
Hprt reverse, 5'-GGGGCTGTACTGCTTAACCAG-3’. 

Dendritic cell preparation. Dendritic cells from lymphoid organs and non- 
lymphoid organs were collected and prepared as described*’. Briefly, spleens, 
MLNs, SLNs (inguinal) and liver were minced and digested in 5 ml Iscove’s 
modified Dulbecco’s media plus 10% FCS (cIMDM) with 250 pg ml collagenase 
B (Roche) and 30Uml ' DNase I (Sigma-Aldrich) for 30min at 37°C with 
stirring. Cells were passed through a 70-1m strainer before red blood cells were 
lysed with ACK lysis buffer. Cells were counted on a Vi-CELL analyser, and 
5-10 X 10° cells were used per antibody staining reaction. Lung cell suspensions 
were prepared after perfusion with 10 ml Dulbecco’s PBS (DPBS) via injections 
into the right ventricle after transection of the lower aorta. Dissected and minced 
lungs were digested in 5 ml cIMDM with 4 mg ml ' collagenase D (Roche) for 1h 
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at 37 °C with stirring. Cells from the peritoneal cavity were collected by washing 
the peritoneal cavity with 10 ml HBSS plus 2% FCS and 2mM EDTA. 
Bone-marrow-derived dendritic cells and macrophages. Bone-marrow-derived 
dendritic cells were generated from bone marrow with 100 ng ml‘ of recombinant 
Fit3L as described**. Bone-marrow-derived macrophages were generated from 
bone marrow with 20ngml~’ of recombinant M-CSF (Peprotech) for 7 days, 
and rested in cIMDM without M-CSF for 24h before stimulation with various 
conditions as described in the figure legends. 

Preparation of T. gondii lysate antigen (STAg). Toxoplasma gondii antigen was 
prepared by lysing tachyzoites of the Pru strain in foreskin fibroblast cultures 
through two cycles of freeze-thaw in liquid nitrogen. Lysate was then filtered, 
aliquoted at a concentration of Imgml ‘ in PBS and stored at —80°C. For 
dendritic cell stimulation, 0.05 pg ml! were used and intracellular IL-12 pro- 
duction was analysed by FACS. 

Cytokine-induced CD8a* dendritic cells. Wild-type and Batf3'~ mice bone 
marrow was prepared as described above. GM-CSF (10 ng ml _') and or IFN-y 
(0.1 ng ml!) was added to the cultures between days 8 and 10. Cells were ana- 
lysed 2 days after cytokine addition. 

Antibodies and flow cytometry. Staining was performed at 4 °C in the presence 
of Fe block (anti-CD16/32 clone 2.4G2, BioXCell) in FACS buffer (DPBS plus 
0.5% BSA plus 2mm EDTA). The antibodies used for dendritic cell analysis were 
as recently described“*. Cells were analysed on BD FACSCanto II or FACSAria II 
and analysed with FlowJo software (Tree star, Inc.). 

Intracellular cytokine staining. For intracellular cytokine staining, cells 
were surface stained, then fixed in 2% paraformaldehyde for 15min at 4°C, 
permeabilized in DPBS plus 0.1% BSA plus 0.5% saponin, and stained for 
intracellular cytokines as previously described*’. Additional antibodies included 
anti-TNF-o (MP6-XT22) and anti-IFN-y (XMG1.2) from BioLegend. 

ELISA and CBA. IL-12p40 concentration was measured from serum samples 
with the mouse IL-12p40 OptEIA ELISA set (BD Bioscience) according to the 
manufacturer’s instructions. The concentration of inflammatory cytokines was 
measured in the serum with the BD CBA mouse inflammation kit (BD 
Biosciences), and data were analysed with FCAP Array software (Soft Flow, Inc.). 
Statistical analysis. Differences between groups in survival were analysed by the 
log-rank test. Analysis of all other data was done with an unpaired, two-tailed 
Student’s t-test with a 95% confidence interval (Prism; GraphPad Software, Inc.). 
P values less than 0.05 were considered significant. *0.01<P<0.05; 
**0.001 <P <0.01; ***P < 0.001; ****P < 0.0001. 

Expression microarray analysis. Total RNA was isolated from cells using the 
Ambion RNAqueous-Micro kit. For mouse genome 430 2.0 arrays, RNA was 
amplified, labelled, fragmented, and hybridized using the 3’ IVT Express kit 
(Affymetrix). Data were normalized and expression values were modelled using 
DNA-Chip analyser (dChip) software (http://www.dChip.org)**. For mouse gene 
1.0 ST arrays, RNA was amplified with the WT expression kit (Ambion) and 
labelled, fragmented and hybridized with the WT terminal labelling and 
hybridization kit (Affymetrix). Data were processed using RMA quantile 
normalization and expression values were modelled using ArrayStar software 
(DNASTAR). All original microarray data have been deposited in NCBI’s 
Gene Expression Omnibus under accession number GSE40647. 

CD4* T-cell cultures. CD4* T cells were purified using the DynaBeads 
FlowComp mouse CD4 kit (Invitrogen) and were activated on anti-CD3/anti- 
CD28 coated plates under the following culture conditions: T;1 conditions: 
anti-IL-4 10 pg ml! (11B11, BioXcell), IFN-y 0.1 wg ml! (Peprotech), IL-12 
10Uml', IL-2 40U ml‘; Ty2 conditions: anti-IL-12 10gml~* (Tosh, 
BioXCell), anti-IFN-y 10,gml~' (XMG1.2, BioXCell), IL-4 10ngml”? 
(Peprotech), IL-2 40Uml '; Ty17 conditions: IL-6 25 ngml ' (Peprotech), 
TGF-B 2ngml! (Peprotech), IL-1B 10 ngml ' (Peprotech), anti-IL-4 10 ng 
ml! (11B11, BioXCell), anti-IFN-y 10 pg ml! (XMG1.2, BioXCell), and anti- 
IL-12 10 pg ml ! (Tosh, BioXCell). IL-2 (40 U ml ') was added for the Ty17 
cultures in Supplementary Fig. 12a. Cells were diluted three-fold in fresh media 
on day 3. On day 5, cells were activated with PMA/ionomycin for analysis of 
cytokines and surface markers by FACS. For some experiments, cells were 
re-stimulated on day 7 under the same conditions and analysed 5 days later. 
Retroviral analysis of BATF functional activity. For CD8a* dendritic cell 
development, bone marrow was cultured with Flt3L as described”, infected with 
retrovirus and 2 1gml ' polybrene on day 1, and analysed for development of 
cDCs (CD11c*B220) on day 10. 

CD4* T cells were cultured under Ty;17 conditions and infected with 
retrovirus and 6 gml ' polybrene on day 1. Cells were analysed on day 5 for 
cytokine expression by intracellular staining. For some experiments, cells were 
re-stimulated on day 7 under the same conditions and analysed 5 days later. 

For analysis of class switch recombination, B220 cells purified by ~B220 MACS 
microbeads (Miltenyi) were cultured with LPS and IL-4 as described”, infected 


45,47 


ARTICLE 


with retrovirus and 6 pg ml’ polybrene on day 1, and analysed for switching to 
IgG, on day 4 by FACS. 

For infection of primary mouse cells, GFP-RV” containing cDNAs for 
transcription factors, chimaeric proteins and mutated BATF were transfected 
into Phoenix E cells as described previously”. Viral supernatants were collected 
2 days later and concentrated by centrifugation®'. Cells were infected with viral 
supernatants by spin infection at 1,800 r.p.m. for 45 min at room temperature. 
Electromobility shift assays (EMSAs). For stable expression in 293FT cells, 
retroviruses (GFP-RV containing BATF, chimaeric proteins or BATF mutants, 
and/or truncated hCD4-RV” containing IRF4 or IRF8 cDNA) were packaged in 
Phoenix A cells and concentrated by centrifugation*’. Infected 293FT cells were 
sorted for retroviral marker expression and when indicated were transiently 
transfected with JUNB-GFP-RV using calcium phosphate. For transient expres- 
sion, 293FT cells were transfected with retroviral constructs using calcium 
phosphate. Nuclear extracts were prepared 48h after transfection. For 293FT 
cells, nuclei were obtained after cellular lysis with buffer A containing 0.2% 
NP40 and nuclear extracts in buffer C were dialysed against buffer D as 
described**. Nuclear extracts from T and B cells were prepared as described’. 

EMSA was as described” using 3 j1g of nuclear extract, 1 mg poly(dIdC) (Sigma) 
and 7%T 3.3%C polyacrylamide gels. For competition assays, extracts were incu- 
bated with excess unlabelled competitor DNA for 20 min before addition of labelled 
probe. For supershifts, extracts were incubated with antibodies goat anti-FOS (4) 
(Santa Cruz) for 293FT cells, or anti-FOS (2G9C3) (Abcam) for mouse T cells, anti- 
IRF4 (H-140) (Santa Cruz), anti-ICSBP (C-19) (Santa Cruz), anti-JUNB (C-11) 
(Santa Cruz) and rabbit anti-BATF*, rabbit anti-BATF2 (described above) and 
rabbit anti-BATF3 (W. KC et al., manuscript submitted) for 1h on ice before 
addition of labelled probe and incubation at room temperature for 35 min. 

The following pairs of oligonucleotides were annealed to generate probes which 
were labelled with **P-dCTP using Klenow polymerase. The AP consensus bind- 
ing site is underlined; the IRF consensus binding site is in bold; mutated bases are in 
lower case. 

AP1 5’-GATCAGCTTCGCTTGATGAGTCAGCCGG-3'/5'-GATCCGGCT 

GACTCATCAAGCGAAG-3’; AICE1 (from third intron of mouse Ctla4) 
5'-CTTGCCTTAGAGGTTTCGGGATGACTAATACTGTA-3'/5'-TCACGTA 
CAGTATTAGTCATCCCGAAACCTCTAAGG-3'; AICE1 M1 (mutates the 
IRF consensus binding site) 5’-CTTGCCTTAGAGGccaCGGGATGACTAA 
TACTGTA-3'/5'-TCACGTACAGTATTAGTCATCCCGtggCCTCTAAGG-3’; 
AICE M2 (mutates the AP1 consensus binding site) 5'-CTTGCCTTAGAG 
GTTTCGGGAgacCTAATACTGTA-3'/5'-TCACGTACAGTATTAGgtcTCCC 
GAAACCTCTAAGG-3’; AICE2 (from third intron of [/23r): 5'-GATG 
TTTCAGGGAAAGCACTGACTCACTGGCTCTCCA-3’/ 5'-GGTGGAGAGC 
CAGTGAGTCAGTGCTTTCCCTGAAA-3'; EICE™* 5’-GAAAAAGAGAAA 
TAAAAGGAAGTGAAACCAAG-3'/5'-GATCCTTGGTTTCACTTCCTTTT 
ATTTCTCTTT-3’; Ex 5'-TCGACATTTTTCTGATTGGTTAAA-3'/5'-GACT 
TTTAACCAATCAGAAAAATG-3’. 
Plasmids. cDNAs for Batf, Batf2, Batf3, Irf8 and JunB were generated by PCR 
from primary cells. MigR1 containing HA-tagged IRF4 was from H. Singh 
(Genentech). HA-IRF4 and IRF8 cDNAs were subcloned into hCD4-RV”. 
Mutations in Batf were generated using QuickChange mutagenesis with BATF- 
GFP-RV as the template. cDNAs for chimaeric proteins were generated by over- 
lap extension and PCR and cloned into GFP-RV. 
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Structure of the agonist-bound 
neurotensin receptor 


Jim F. White', Nicholas Noinaj’, Yoko Shibata"+, James Love*}, Brian Kloss*, Feng Xu, Jelena Gvozdenovic-Jeremic'y, 
Priyanka Shah!, Joseph Shiloach®, Christopher G. Tate? & Reinhard Grisshammer! 


Neurotensin (NTS) is a 13-amino-acid peptide that functions as both a neurotransmitter and a hormone through the 
activation of the neurotensin receptor NTSR1, a G-protein-coupled receptor (GPCR). In the brain, NTS modulates the 
activity of dopaminergic systems, opioid-independent analgesia, and the inhibition of food intake; in the gut, NTS 
regulates a range of digestive processes. Here we present the structure at 2.8 A resolution of Rattus norvegicus NTSR1 
in an active-like state, bound to NTSg_j3, the carboxy-terminal portion of NTS responsible for agonist-induced 
activation of the receptor. The peptide agonist binds to NTSRI in an extended conformation nearly perpendicular to 
the membrane plane, with the C terminus oriented towards the receptor core. Our findings provide, to our knowledge, 
the first insight into the binding mode of a peptide agonist to a GPCR and may support the development of non-peptide 
ligands that could be useful in the treatment of neurological disorders, cancer and obesity. 


Neurotensin (NTS) is a short peptide that is found in the nervous 
system and in peripheral tissues’. NTS shows a wide range of biological 
activities and has important roles in Parkinson’s disease and the patho- 
genesis of schizophrenia, the modulation of dopamine neurotransmis- 
sion, hypothermia, antinociception, and in promoting the growth of 
cancer cells**. Three neurotensin receptors have been identified. 
NTSRI (ref. 7) and NTSR2 (ref. 8) belong to the class A GPCR family, 
whereas NTSR3 (also called SORT1) is a member of the sortilin family 
with a single transmembrane domain’. Most of the known effects of 
NTS are mediated through NTSRI (ref. 5), which signals preferentially 
via the G, protein. Many aspects of ligand binding to NTSR1 have been 
addressed by mutagenesis and modelling'*” but the details of ligand 
binding remain poorly understood at the molecular level. This has 
limited the rational design of compounds targeting NTSRI for thera- 
peutic purposes’ in the absence of structural information on NTSR1 in 
complex with agonist or antagonist. 

Recent progress in elucidating GPCR structures in complex with 
either small-molecule antagonists, agonists, or in complex with a 
heterotrimeric G protein has provided detailed molecular insights 
into GPCR function (reviewed in refs 13-15). However, most receptor 
structures are of GPCRs belonging to the « group of class A (ref. 16), 
which characteristically bind small endogenous agonists deep within 
the transmembrane core. NTSR1 belongs to a large family of peptide 
receptors in the B group of class A GPCRs’* (Supplementary Fig. 1) that 
includes receptors for substance P, cholecystokinin, neuropeptide Y, 
endothelin and oxytocin. These peptide receptors bind agonists of a 
wide range of sizes, from a few amino acid residues to over 60. Current 
structural information on peptide receptors is limited to representa- 
tives of the y group of class A GPCRs’®, namely the opioid receptors'” *° 
and a chemokine receptor’, all bound to non-peptidic drug-like anta- 
gonists. The CXCR4 receptor has also been crystallized in the presence 
of a cyclic peptide antagonist*’. Although the structural understanding 


of peptide receptors has advanced, structures of peptide GPCRs from 
the B group have not yet been reported. In addition, no structures have 
been presented of a GPCR bound to a peptide agonist and only the 
structures of rhodopsin**”® and the adenosine Az, receptor (Az,4R)** 
have been determined bound to endogenous agonists. Here we present 
the crystal structure of Rattus norvegicus NTSR1 from the B GPCR 
subfamily bound to the C-terminal portion (NTS, _;3) of the endo- 
genous agonist NTS. 


Structure determination of the agonist-bound NTSR1 


Wild-type NTSRI1 is unstable in detergent solution and hence is a 
difficult target for structural studies, particularly in the agonist-bound 
state which is generally thought to be less stable than the antagonist- 
bound state”*. The first step was therefore to stabilize NTSR1 using the 
strategy of conformational thermostabilization”’. Previous work had 
generated a thermostable mutant of NTSR1, but the purified receptor 
was found to be unsuitable for crystallization, possibly because the 
receptor contained mutations that were selected in the unliganded 
state’’”. NTSRI1 was therefore thermostabilized in the presence of the 
agonist NTS through the inclusion of six stabilizing mutations 
(mutant NTSR1-GW5; A86L'™*, E166A*°, G215A*™, L310A°”, 
F358A’ and V360A’**, where superscripts are the Ballesteros— 
Weinstein numbers**) (Supplementary Fig. 2 and Supplementary 
Table 1). Pharmacological characterization of NISR1-GW5 (Sup- 
plementary Fig. 3, Supplementary Table 2 and Supplementary Dis- 
cussion) showed that the affinity for the agonist NTS was similar to 
that of the wild-type receptor, but that the apparent affinity for the 
neutral antagonist SR48692 and the sensitivity to Na” ions was lower. 
In G protein coupling assays, NTSR1-GW5 did not catalyse nucleotide 
exchange at Go, in response to NTS (Supplementary Fig. 3), indicating 
that the stabilizing mutations may have limited the ability to propagate 
the agonist-induced conformational change at the ligand-binding 
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pocket to the re-arrangement of the intracellular face of the receptor 
required for G protein binding and activation. 

For crystallization of NISR1-GW5, T4 lysozyme (T4L) was engi- 
neered into intracellular loop 3 (ICL3) to improve the probability of 
obtaining well-diffracting crystals. NTSR1-GW5-T4L was expressed 
in insect cells using the baculovirus expression system, solubilized and 
purified in lauryl maltose neopentyl glycol and crystallized using the 
lipidic cubic phase methodology*’. Diffraction data were collected to 
2.8 A resolution froma single crystal and used to determine the structure 
by molecular replacement (Supplementary Table 3, Supplementary 
Fig. 4 and Supplementary Fig. 5). The final model of NITSR1-GW5- 
TAL, refined to R/R¢ee Values of 0.23/0.28, includes 454 residues, 2 
HEPES molecules, 23 H,O molecules and the agonist ligand NTSg_ 3. 


Overall structure of NTSRI-GW5-T4L 


The receptor core has seven transmembrane helices (TM1-TM7) 
(Fig. 1 and Supplementary Fig. 6), as expected, but there was no 
density corresponding to amphipathic helix 8 at the C terminus of 
the receptor. Helix 8 has been observed in all GPCR structures except 
for the chemokine receptor CXCR4 (ref. 21), so the absence of helix 8 
in the structure of NISR1-GW5-T4L could be because either NTSR1 
in vivo does not possess helix 8 or this is the result of a crystallization 
artefact. On the intracellular side, TM7 is elongated by three helical 
turns beyond the conserved NP’°°XXY’** motif, whereas other 
GPCRs have only one helical turn. This extension of TM7 is associated 
with the intracellular end of TM6 through hydrogen bonds between 
the side chain atoms of N375 and Q378 to the backbone carbonyl 
oxygen of A302°”? (Supplementary Fig. 7). 

Of the intracellular loops in NTSR1, ICL1 was disordered (K93- 
Q96), ICL3 was disrupted through insertion of T4L, and only ICL2 
showed density for a structural element, a short m-helix (Fig. 1). ICL2 
is thought to be important for G protein coupling, and the short helix 
in NTSRI1 is similar to those observed in identical positions in both 
the inactive state*’ and in active states**** of B-adrenergic receptors 


NTSR1 


Figure 1 | Overview of the NTSR1 structure bound to the peptide agonist 
NTSs_)3. a-c, Cartoon representation of NISR1-GW5-T4L; side view 

(a), extracellular view (b), intracellular view (c). Space-filling models are used to 
depict the agonist NTSg_;3 (orange), the side chains of thermostabilizing 


ARTICLE 


(BARs). In the B,-adrenergic receptor (B, AR)*’, the helix in ICL2 is 
stabilized by the interaction of a tyrosine residue in the middle of the 
helix that forms a hydrogen bond with the aspartate residue of 
the conserved D/ER*”°Y sequence in TM3. In NTSR1, K178 in the 
middle of the m-helix in ICL2 may perform a similar function, 
although its side chain is disordered, probably because the stabilizing 
mutation E166A*” prevents the formation of the salt bridge. The 
extracellular surface of NTSR1 is composed of three short extracellular 
loops (ECL1, ECL2 and ECL3) and the N-terminal residues N52-D60 
that extend across the receptor to form one side of the ligand-binding 
pocket (see below). The longest loop is ECL2, which contains two short 
antiparallel §-strands that are linked to TM3 by a disulphide bond 
between the conserved residues C142*”° and C225'°'*. These 
B-strands are characteristic of all the peptide GPCR structures deter- 
mined so far. 

NTSR1-GW5-T4L contains six thermostabilizing mutations and 
none of these residues is in the NTS binding pocket (Fig. 1). Three of 
these mutations may directly affect the equilibrium between inactive 
and active states of NISR1. The mutation F358A’** is located at the 
kink in TM7, with the alanine side chain pointing into the receptor core, 
and the F358A mutant is known to be constitutively active*. The muta- 
tion E166A**? is in the conserved D/ER*°°Y motif in TM3 and will 
prevent the formation of the intrahelical salt bridge with R167°°°, thus 
facilitating the transition to the active state***° by allowing R167°”” to 
interact with N257°°*. The R167-N257 interaction may be promoted 
by the L310A°°” mutation through decreasing the size of the side chain. 
The reasons for the thermostabilizing effect of the A86L'**, V360A7# 
and G215A**' mutations are not obvious, although it is interesting 
that an alanine residue’ is found in human NTSRI1 (Supplementary 
Fig. 8) at the position equivalent to G215 in rat NTSR1. 


NTSRI-GWS5 is in an active-like conformation 
Upon agonist binding, a GPCR undergoes conformational changes 
that ultimately allow binding and activation of a G protein at the 


mutations (purple) and the disulphide bond (yellow) between the conserved 
residues C142 and C225. Also shown are the B-hairpin in extracellular loop 2 
(blue-green) and the m-helix in intracellular loop 2 (ICL2). T4L has been 
omitted from the intracellular view for clarity. 
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intracellular surface of the receptor. Agonist-induced conformational 
changes in the ligand-binding pocket are largely receptor specific and 
thought to be relatively small, leading to larger conformational changes 
in the intracellular half of the receptor’’. There are no reported struc- 
tures of NTSR1 in an inactive conformation, so we are unable to discuss 
agonist-specific conformational changes in the ligand-binding pocket. 
However, it is possible to compare the NTSR1 structure to the inactive 
and active conformations of rhodopsin and BAR. This allows an 
assessment of the overall conformation of NTSR1 and, in addition, 
the rotamer conformations of mechanistically important residues that 
show characteristic interactions depending on whether the receptor is 
in an inactive or active state. These comparisons indicate that NTSR1 is 
in an active-like state. 

One of the major conformational changes occurring during the 
transition from the inactive to an active state is an outward tilt of 
TM6 at the intracellular face of the GPCR. The extent of movement 
of TM6 upon activation is 6 A for rhodopsin and 11-14 A for B,AR®, 
with TM6 of NTSR1 ina position similar to that seen for the active states 
of rhodopsin’* (Fig. 2). The distance between the Co atoms of R*”” 
and E®*° change upon activation from 9 A to 15 A in rhodopsin?” and 
from 11 A to 19A in B,AR™°, The equivalent distance in NTSR1 is 
14A. 

Other features of NTSR1 are also remarkably similar to those found 
in the active conformations of rhodopsin and B,AR. In inactive recep- 
tor conformations, R°**° of the D/ERY motif forms an intrahelical salt 
bridge with D/E°>*”, which is, for example, present in the inactive 
structure of the nociceptin/orphanin FQ receptor (NOP)”” (Fig. 3), 
which is the closest structural homologue of NTSR1 among peptide 
receptors. Upon activation, the D/E**”-R®” salt bridge is broken and 
R**° re-orients to form an interhelical hydrogen bond with the con- 
served tyrosine residue Yee, linking TM3 and TMS (ref. 13). In 
NTSRI, the R167°”° side chain is hydrogen bonded to N257°”* that 
interacts with the conserved residue $164°*” (Fig. 3), stabilizing 
TMS in an active-like orientation*®. In addition, Y369”~° of the highly 
conserved NP’*°XXY’* motif occupies space within the helical bundle 
as seen for Y’>> in the active states of rhodopsin and B,AR (Fig. 3). In 
contrast to the active rhodopsin and BAR, Y369””* in NTSR1 does not 
pack against R167°°°, suggesting that N257°°* has a greater contri- 
bution to the active state stabilization than Y369”°° (ref. 35). 

Comparison of the NTSR1 structure with rhodopsin and B,AR 
suggests that it is in an active-like state, indicating that it may couple 
effectively to G proteins. NTSR1-GW5-TAL, as expected, does not 
couple to G protein (Supplementary Fig. 3), because T4L replaces 
most of ICL3, which is essential for G protein binding, and sterically 


NTSR1/NOP 


Figure 2 | NTSR1-GW5-T4L is in an active-like conformation. a, Alignment 
of NTSR1-GW5 (green) with the inactive state of the nociceptin receptor 
(NOP)? (red; Protein Data Bank (PDB) code 4EA3). NTSR1 was most similar 
to NOP (root mean squared deviation = 2.1 A for Co. atoms in the TM 
domains) upon alignment of the TM domains of NTSR1 with other peptide 
receptors'’*'. b, Alignment of the inactive state of rhodopsin” (light brown, 
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blocks access to the intracellular surface of the receptor. Surprisingly, 
NTSR1-GWS that contains the wild-type ICL3 also does not catalyse 
nucleotide exchange at Gog in response to NTS (Supplementary 
Fig. 3). This may be a consequence of the extended region of 'TM7 
on the intracellular surface forming an interhelical contact with TM6 
(Supplementary Fig. 7). This appears to occlude the cavity at the 
intracellular face of the receptor where the C-terminal portion of 
the Go protein binds to fully active receptors*******. Thus, although 
the structure of NTSR1 contains many features of an active receptor it 
is possible that the intracellular surface of the receptor does not rep- 
resent the structure of fully active NTSRI. 


The NTS¢g_13 binding site 

NTSR1-GW5-TAL was co-crystallized with NTSs_13, which has higher 
potency and efficacy than full-length NTS", although it does not 
exist in vivo. NTSR1 also binds the NTS-related hexapeptide neurome- 
din N, which is produced along with NTS during proteolytic processing 
of the proNTS precursor’. The amino acid sequence of neuromedin N 
(KIPYIL) is similar to NTSg_}3 (RRPYIL). Potent NTS analogues 
that can cross the blood-brain barrier (for example, derivatives of 
RKPWLL) can also be synthesized (see ref. 5). The similarity of these 
agonists to NTSs_13 indicates that they all bind in a similar manner. 
There is charge complementarity between NTSg_13 and its binding 
pocket, with the positive-charged arginine side chains of NTS, _\3 
adjacent to the electronegative rim of the binding site, and the C 
terminus of NTS¢_;3 is predicted to form a salt bridge with R327°** 
(Fig. 4 and Supplementary Table 4). There are also extensive van der 
Waals interactions between NTSg_13 and the receptor along with 
hydrogen bonds and a salt bridge (Supplementary Table 4, Sup- 
plementary Table 5 and Supplementary Fig. 9). It is striking that only 
three out of eight hydrogen bonds are made between the side chains of 
NTSs_13 and the receptor, with the bulk of receptor-ligand contacts 
being van der Waals interactions. 

The ligand binding pocket for NTSg_;3 is composed of amino acid 
residues in the N terminus, the three extracellular loops (ECL1-ECL3) 
and six transmembrane o-helices (TM2-TM7), with the greatest 
number of ligand-receptor contacts from residues in ECL2, ECL3, 
TM6 and TM7 (Supplementary Table 5). The binding pocket is wide 
open on the extracellular surface of NTSRI1 (Fig. 4 and Supplementary 
Fig. 10) with density for NTSs_;3 clearly discernable from a difference 
electron density map (Supplementary Fig. 11), but we believe that radia- 
tion damage during data collection has resulted in reduced density 
for the C-terminal carboxylate group (Supplementary Fig. 12). The 
extended conformation of NTSg_;3 in the crystal structure is in good 


c TM6 


NTSR1/B,-adrenergic receptor 


PDB code 1GZM), its active state meta-II (ref. 22) (blue-green, PDB code 
3PQR) and NTSR1-GWS5 (green). c, Alignment of the inactive state of BAR” 
(pale mauve, PDB code 2RH1) and its active state** (pale grey, PDB code 3SN6) 
and NTSR1-GWS5 (green). All views are from the intracellular surface. The 
arrows indicate the apparent displacement of TM5 and TM6 of NTSR1-GW5 
relative to the corresponding helix positions in the inactive receptor structures. 
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Figure 3 | The conserved D/ERY and NPXXY motifs in NTSR1-GW5. 

a, Comparison of the side-chain orientations of R*°° and Y/N*”* of NTSR1- 
GW5 and the inactive nociceptin receptor (NOP) (PDB code 4EA3). NTSR1- 
GWS5 residues (R167, N257, labelled) are in green; corresponding NOP residues 
(R148, Y235, unlabelled) are in red. D147**° of NOP and A166*” of NTSR1 
are also shown. b, In an inactive GPCR, R®**” interacts with E*“? as shown here 
for rhodopsin® (pale brown, PDB code 1GZM). This salt bridge is broken upon 
activation, allowing R*°° to contact residue Y/N°**, as depicted for active 


agreement with that of NTSs_;3 bound to wild-type NTSRI1 analysed 
by solid-state NMR* (Supplementary Fig. 13). The overall shape of 
the NTSRI ligand-binding pocket is narrower than that observed in 
the other peptide receptors’’*' due to an inward shift of the extra- 
cellular regions of TM2 and TM6 (Supplementary Fig. 14), probably as 


B-hairpin 


Figure 4 | The NTSR1 agonist binding pocket. All views are from the 
extracellular side, with NTS,_,3 shown as a stick model. a, Cartoon 
representation of the ligand-binding pocket with the ECL2 B-hairpin (pink). 
The quality of the electron density of NTS_;3 is depicted as blue isosurface 
(2F, — F. map contoured at 1c). b, Key NTSR1 residues (green residues with 
grey labels) in contact with the peptide ligand (grey residues with bold black 
labels). Hydrogen bonds and salt bridges are indicated by dashed lines (black). 
The electron density maps of selected NTSR1 residues (L55, Y146, T226, R327) 
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rhodopsin (ref. 22) (blue-green, PDB code 3PQR) (residues E134, R135 and 
Y223). c, The same comparison for inactive B,AR“ (pale mauve, PDB code 
2RH1) and active B,AR” (pale grey, PDB code 3SN6) (residues E130, R131 and 
Y219). d, Y369”*" of NTSR1 (green) occupies space as seen for 306"? in 
active rhodopsin (blue-green; PDB code 3PQR). The orientation of 3067? in 
the inactive rhodopsin (pale brown; PDB code 1GZM) and the inactive NOP 
(red; PDB code 4EA3) is shown for comparison. 


a result of the pronounced kink in TM2 at A120**” and a change in tilt 
of TM6. At the extracellular surface, the NTSs_13 binding pocket is 
partially capped by the ECL2 B-hairpin and the proximal end of the 
receptor N terminus. NTSR1 has been subjected to extensive site- 
directed mutagenesis studies to define ligand-receptor interactions 


are shown as blue isosurface (2F, — F. map contoured at 10). c, The charge 
complementarity between NTSg_;3 and its binding pocket is depicted with the 
NTSRI surface coloured according to its electrostatic potential (—4 kT e ' to 
+4kTe’ |; red, negative; blue, positive). The positively charged arginine side 
chains of the ligand face the electronegative rim of the binding pocket, whereas 
the negatively charged carboxylate of L13 resides in an electropositive 
environment. See also Supplementary Information. 
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NTS,_13 


Adenosine 


Isoprenaline 


All-trans-retinal 


Figure 5 | A new paradigm for peptide agonist binding. Compared to small 
endogenous agonists, the NTSg_}; binding cavity is located near the receptor 
surface. a, The structures of the agonist-bound adenosine Az, receptor” (PDB 
code 2YDO), 8, AR* (PDB code 2Y03) and meta-rhodopsin II (ref. 22) (PDB 
code 3PQR) were aligned in PyMOL. Only the cartoon representation for 
NTSR1-GWS5S is shown (pale green) with NTSs_13 depicted in purple. The 


(see Supplementary Discussion) and there is excellent agreement 
between those data and the structure of NTSRI. 

There is a striking difference between the binding mode of NTSs_13 
in NTSRI1 with the binding of agonists in rhodopsin, 8; AR and Aj,R 
(Fig. 5). The agonist-specific interactions made by isoprenaline in 
B, AR (ref. 43) and adenosine in Az R (ref. 24) all occur at a similar 
depth in the receptor with respect to the extracellular surface as retinal 
in rhodopsin”*”’. In contrast, NTSg_)3 does not penetrate the receptor 
as deeply, with the C terminus of NTSg_13 over 5 A closer to the 
extracellular surface than the hydroxyl groups in isoprenaline and 
adenosine that are characteristic of agonists for BARs and A;,R. 
This indicates that the mode of activation of NTSR1 is subtly different 
from these receptors. Structures of NTSR1 in the inactive state will be 
necessary to gain a better understanding of how peptide agonists 
activate GPCRs. 


METHODS SUMMARY 


The stabilized NISR1 mutant with T4 lysozyme replacing most of intracellular 
loop 3 was expressed in insect cells and purified by cobalt affinity chromato- 
graphy. It was crystallized in lipidic cubic phase. Diffraction data were collected at 
the GM/CA-CAT beamline of the Advanced Photon Source at the Argonne 
National Laboratory. The structure was solved by molecular replacement using 
data from a single crystal. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Abbreviations. AEBSF, 4-(2-aminoethyl)benzenesulphonyl fluoride hydrochloride; 
CHAPS, 3-[(3-cholamidopropyl)dimethylammonio]propanesulphonic acid; CHS, 
cholesteryl hemisuccinate Tris salt; DDM, n-dodecyl-B-p-maltopyranoside; 
DM, n-decyl-B-p-maltopyranoside; ICL, intracellular loop; LMNG, lauryl malt- 
ose neopentyl glycol (2,2-didecylpropane-1,3-bis-B-p-maltopyranoside); T4L, 
cysteine-free bacteriophage T4 lysozyme (C54T, C97A); TCEP, Tris (2-carboxyethyl) 
phosphine hydrochloride. 

Materials. The tritiated agonist [7H]NTS ([3,11-tyrosyl-3,5-3H(N)]-pyroGlu- 
Leu-Tyr-Glu-Asn-Lys-Pro-Arg-Arg-Pro-Tyr-Ile-Leu) was purchased from 
Perkin Elmer. Unlabelled NTS and NTSg_;3 (Arg-Arg-Pro-Tyr-Ile-Leu) were 
synthesized by the Center for Biologics Evaluation and Research (Food and 
Drug Administration) or purchased from AnaSpec. The detergents DDM, DM, 
LMNG“, CHAPS and CHS were obtained from Anatrace. TCEP was also from 
Anatrace. Monoolein (M-7765), cholesterol (C-75209), lysozyme (L-6876) and 
DNase I (D-4527) were purchased from Sigma. Talon resin was from Clontech. 
BioSpin-30 Tris columns were obtained from BioRad. PD10 columns were pur- 
chased from GE Healthcare. 

Expression of NTSR1 in insect cells. The baculovirus construct NTSR1-GW5- 
TAL consisted of the haemagglutinin signal peptide and the Flag tag*’, followed by 
the thermostabilized rat NTSR1 (T43-K396 containing the mutations A8é6L, 
E166A, G215A, L310A, F358A, V360A) with the ICL3 residues H269-R299 
replaced by the cysteine-free bacteriophage T4 lysozyme (N2-Y161 with the 
mutations C54T and C97A) and a GSGS linker. A deca-histidine tag was placed 
at the C terminus. NTSR1-GW5 contained the wild-type ICL3 sequence. Wild- 
type NTSR1 (Met-T43-Y424) started at T43 but did not contain any other mod- 
ifications. Recombinant baculoviruses were generated using a modified pFastBacl 
transfer plasmid (Invitrogen). Trichoplusia ni cells were infected at a cell density of 
0.8-1 million cells per ml with recombinant virus at a multiplicity of infection 
(MOI) of 5, and the temperature was lowered from 28°C to 21°C. Cells were 
collected by centrifugation 48 h after infection, re-suspended in hypotonic buffer 
(10 mM HEPES pH 7.5, 10 mM MgCh, 20 mM KC), flash-frozen in liquid nitro- 
gen and stored at -80 °C until use. 

Preparation of urea-washed P2 insect cell membranes. NTSR1-enriched mem- 
branes were obtained as a P2 fraction from insect cells***’. Prior to G protein 
coupling assays and agonist binding experiments, the P2 membranes were treated 
with urea to remove peripherally bound membrane proteins**”’. The receptor 
density in urea-washed P2 membranes was determined by [*H]NTS saturation 
binding analysis”. 

Stability tests in detergent solution. Cell pellets from 10 ml of insect cell cultures 
(NTSR1-GW5-T4L; cell density at harvest ~1.5-2.5 million cells per ml with ~10 
million receptors per cell) or P2 membranes (wild-type NTSR1) were re-sus- 
pended in 2ml buffer containing DM or LMNG/CHS to give a final buffer 
composition of 50 mM TrisHCl pH 7.4, 200mM NaCl, 1% DM or 1% LMNG/ 
0.1% CHS. The samples were placed on a rotating mixer at 4 °C for 1 h. Cell debris 
and non-solubilized material were removed by ultracentrifugation (TL120.2 
rotor, 60,000 r.p.m., 4°C, 30min in Optima Max bench-top ultracentrifuge, 
Beckman), and the supernatants containing detergent-solubilized NTSR1 were 
used to test for thermal stability in the + NTS format*’. For thermal denaturation 
curves, the supernatants were diluted 6.67-fold into assay buffer (50 mM TrisHCl 
pH 7.4, 200 mM NaCl) containing 10 nM (HINTS and incubated for 1-2.5h on 
ice to allow [*H]NTS binding to NTSR1. Samples (120 pil aliquots) were exposed 
to different temperatures between 0 °C and 70 °C for 30 min and placed on ice. 
Separation of receptor-ligand complex from free ligand (100 pl) was achieved by 
centrifugation-assisted gel filtration (spin assay) using Bio-Spin 30 Tris columns, 
equilibrated with RDB buffer (50 mM TrisHCl pH 7.4, 1 mM EDTA, 0.1% DDM, 
0.2% CHAPS, 0.04% CHS). Control reactions on ice were recorded at the start and 
at the end of each denaturation experiment. The percentage of activity remaining 
after heat exposure was determined with respect to the unheated control. Data 
were analysed by nonlinear regression using a Boltzmann sigmoidal equation in 
Prism software (GraphPad). 

Ligand-binding experiments. All radio-ligand binding assays were conducted 
with urea-washed P2 insect cell membranes containing wild-type NTSRI, 
NTSR1-GW5-T4L or NTSR1I-GW5. 

For agonist [*H]NTS saturation binding experiments, receptors were incu- 
bated on ice for 1h in 2501 TEBB buffer (50mM TrisHCl pH7.4, 1mM 
EDTA, 0.1% BSA, 0.004% bacitracin) containing (HINTS at a concentration 
of 0.6-20 nM. Nonspecific [*H]NTS binding was determined in the presence of 
50 uM unlabelled NTS. Separation of bound from free ligand was achieved by 
rapid filtration through GF/B glass fibre filters (Whatman) pretreated with poly- 
ethylenimine. The amount of radioactivity was quantified by liquid scintillation 
counting (Beckman LS6500). Data were analysed by nonlinear regression using 
the GraphPad Prism software and best fit to a one-site binding equation to 


determine the equilibrium dissociation constant (Ky) for wild-type NTSRI/ 
‘apparent’ dissociation constants for NTSR1-GW5-T4L and NTSR1-GW5. 
Note that the saturation binding experiments using wild-type NTSR1 were con- 
ducted at equilibrium. In contrast, binding of (HINTS to NTSR1-GW5-T4L and 
NTSR1-GWS5 did not reach equilibrium within the incubation time because of the 
very slow agonist off-rates. 

Competition assays with the non-peptide antagonist SR48692 (ref. 50) were 
performed in the presence of (HINTS (TEBB buffer, 5nM [*H]NTS, NTSR1 
concentration <0.5nM, incubation for 2h on ice, 250 ul assay volume). Data 
were fit to a sigmoidal dose-response equation with standard slope using the 
concentrations of total SR48692 added versus bound [*H]NTS. 

The effect of Na* ions on [>HJNTS binding was measured with NaCl concen- 
trations ranging from 0 to 1 M (TEBB buffer, 8 nM CHINTS, NTSRI1 concentra- 
tion <0.5nM, incubation for 1.5h on ice, 300 ul assay volume). Data were 
analysed by nonlinear regression using the GraphPad Prism four-parameter 
dose-response equation (variable slope) with the top and bottom plateaux con- 
strained from 100-15% (wild-type NTSR1) and 100-50% (NTSR1-GW5-T4L, 
NTSR1-GWS5S), respectively. 

The association of [*H]NTS was assessed at a concentration of 10nM (TEBB 
buffer, NTSR1 concentration <0.5nM). At the indicated time points, 250 ul 
aliquots were filtered over glass fibre filters. After 2h, (HINTS dissociation 
was initiated by adding 41.7\1M unlabelled NTS or by addition of 41.7 1M 
NTS and 833mM NaCl; this step reduced the concentration of (HINTS to 
8 nM. Samples were subjected to filtration after the indicated time points. No 
attempt was made to compare quantitatively the observed rates of association and 
the dissociation rate constants between NTSR1 constructs because of the very fast 
association of agonist to wild-type NTSR1, the very fast dissociation of agonist 
from wild-type NTSR1 in the presence of NaCl, and the very slow dissociation of 
[HINTS from NTSR1-GW5-T4L and NTSR1-GW5. 

GTP-yS assays. Before G protein coupling assays, the P2 membranes were treated 
with urea to remove peripherally bound membrane proteins***°. GDP/[*°S]GTP- 
yS exchange assays were performed as previously described” with 1 nM receptor, 
100nM Ga, (purified from dark-adapted Sepia officinalis retina**), 500nM 
G17, (purified from bovine retina*'), and 10 uM NTS, 40 uM nonpeptide ant- 
agonist SR48692 (ref. 50) or no ligand in the reaction (5 min at 30 °C). 
Large-scale purification of NISR1-GW5-T4L from insect cell membranes. 
Cells from 31 of insect cell culture were thawed and the volume was brought to 
approximately 240 ml with hypotonic buffer (10mM HEPES pH 7.5, 10mM 
MgCl,, 20mM KCl). The cells were then re-suspended using a Turrax T-25 
(IKA) homogenizer at 8,200r.p.m. for 2 min. After centrifugation (45Ti rotor, 
40,000 r.p.m., 45 min, 4°C, Optima L90K, Beckman), the membranes were re- 
suspended (Turrax T-25) in approximately 240 ml of high-salt buffer (10 mM 
HEPES pH 7.5, 1 M NaCl, 10 mM MgCl, 20 mM KCl) supplemented with DNase 
I (final concentration 10 1g ml ~ ') and AEBSF (100 |1M), and centrifuged again. 
All subsequent steps were performed at 4 °C or on ice, and AEBSF (100 uM final 
concentration) was repeatedly added throughout the procedure. The washed 
membranes were re-suspended in 60 ml of buffer (100mM TrisHCl pH7.4, 
60% glycerol) containing 20 1M NTSg_;3, and stirred for 45 min to allow agonist 
binding to membrane-inserted NTSR1-GW5-TAL. The receptor was extracted by 
drop-wise addition of 40 ml of a 3% LMNG/0.3% CHS solution. After 1 h, NaCl 
was added, and the solution was gently stirred for an additional hour. The final 
volume was 120ml containing 50 mM TrisHCl pH 7.4, 30% glycerol, 500 mM 
NaCl, 1% LMNG/0.1% CHS and 10 p.M NTSg_,3. The sample was clarified by 
centrifugation (45Ti rotor, 40,000 r.p.m., 1h, Optima L90K, Beckman), adjusted 
with imidazole to a final concentration of 20 mM, and then passed through a 
0.2 jm filter (Stericup). Next, the sample was loaded at a flow rate of 0.2 ml min ' 
onto 2.5 ml Talon resin packed into an XK16 column (GE Healthcare) equili- 
brated with Talon-A* buffer (50mM TrisHCl pH7.4, 30% glycerol, 500 mM 
NaCl, 20 mM imidazole, 1 4M NTSg_ 13, 0.1% LMNG/0.01% CHS). After washing 
with 29 column volumes of buffer Talon-A*, NTSR1-GW5-TA4L was eluted with 
Talon-B~ buffer (50 mM TrisHCl pH 7.4, 30% glycerol, 500 mM NaCl, 200 mM 
imidazole, 5 uM NTSs_13, 0.1% LMNG/0.01% CHS). Peak fractions were col- 
lected (5ml) and desalted using PD10 columns equilibrated in PD10 buffer 
(20 mM TrisHCl pH 7.4, 100 mM NaCl, 0.003% LMNG/0.0003% CHS). NTSs_13 
was then added to a concentration of 101M, and the sample was used for 
crystallization. Three litres of insect cell culture yielded ~1.7 mg of purified 
NTSR1-GW5-T4L. 

Crystallization. Purified desalted NISR1-GW5-T4L was diluted fourfold into 
buffer (25mM TrisHCl pH7.4, 100mM NaCl) containing 100}4M TCEP 
and then concentrated to an estimated 20-25 mg ml! using a 100,000 MWCO 
concentrator (Amicon Ultra, Millipore). After addition of NTSs_13 to 1mM 
and centrifugation (TLA 120.1 rotor, 60,000r.p.m., 15min, 8°C, Beckman), 
the sample was mixed with 1.5 parts by weight of a mix of monoolein with 
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cholesterol (10:1) using the two-syringe method*. The resulting lipidic cubic 
phase** mix was dispensed in 65-nl drops onto Laminex plates (Molecular 
Dimensions) and overlaid with 750-nl precipitant solution. Crystals grew after 
lweek in precipitant solution consisting of 20.8% PEG 400, 80mM HEPES 
pH7.0, 2mM TCEP and 43 mM Nak tartrate. Crystals were collected directly 
from LCP using 50 tm micro-loops (M5-L18SP-50, MiTeGen) and immediately 
flash frozen in liquid nitrogen without adding extra cryoprotectant. 

Data collection and structure determination. Data collection was performed 
with the JBlulce-EPICS data acquisition software at the GM/CA-CAT (23-ID-B) 
beamline at the Advanced Photon Source of Argonne National Laboratory. 
Crystals within the loop were located by diffraction using the automated rastering 
module of JBlulce-EPICS***°. Most crystals showed diffraction spots to ~4-5 A;a 
few crystals diffracted to ~3-3.5 A with a single crystal showing diffraction to 
~2.6 A. A single data set was collected from this crystal containing 150 frames 
with an oscillation of 1 degree per frame and was processed in space group P2, to 
2.80 A with an overall completeness of 93.1%. 

Structure determination was performed by molecular replacement using the 
Phaser-MR module of PHENIX**. Two search models were created using the 
structure of B,AR”’ (PDB code 3NY9) with one containing the T4 lysozyme 
domain and one containing the receptor seven-helix bundle without side chains. 
One copy of each search model was found, producing a single solution. 
Subsequent refinement was performed by PHENIX* and model building with 
COOT™. Strong difference density was observed within the ligand-binding cavity 
which we were able to model unambiguously as the six-residue neurotensin 
peptide RRPYIL (NTSg_;3). The structure was refined with final R/Rfree values 
of 0.23/0.28. A summary of data collection and refinement statistics is reported in 
Supplementary Table 3. 

We noted during the later stages of structure determination that no electron 
density was observed for the C-terminal carboxyl group of NTSs_13 (using an 
F, — Fe map contoured at ~0.1e~ A~*, ~20) despite clear electron density for 
the rest of the agonist peptide. However, relatively strong density was found in 
close proximity to the modelled C-terminal carboxyl group of L13 (Supplementary 
Fig. 12). We interpreted the lack of density for the modelled C-terminal carboxyl 
group of NTS._,, and the presence of electron density close by as a probable result 
of decarboxylation caused by radiation damage” (estimated absorbed dose over 
150 frames ~314MGy, 10X experimental dose limit, RADDOSE*') with the 
cleaved entity being stabilized by a potential salt-bridge-like interaction with 
R327. However, for the purpose of understanding the biology and the interactions 
involved in agonist binding by NTSR1 in vivo, that is, in the absence of radiation 
damage, and to minimize any confusion for those scientists who may not be 
familiar with X-ray crystallography yet who may view the NTSR1-GW5-T4L 
structure, we have chosen to model NTSs_3 with a carboxyl group at its C terminus 
in the final NTSR1-GW5-T4L model. 
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Figures were prepared in PyMOL (Schrédinger). Structural alignments were 
done with the ‘align’ command of PyMOL. 
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Unresolved anisotropies of the cosmic near-infrared background 
radiation are expected to have contributions from the earliest gal- 
axies during the epoch of reionization’* and from faint, dwarf 
galaxies at intermediate redshifts®’. Previous measurements*”” 
were unable to pinpoint conclusively the dominant origin because 
they did not sample spatial scales that were sufficiently large to 
distinguish between these two possibilities. Here we report a mea- 
surement of the anisotropy power spectrum from subarcminute to 
one-degree angular scales, and find the clustering amplitude to 
be larger than predicted by the models based on the two existing 
explanations. As the shot-noise level of the power spectrum is 
consistent with that expected from faint galaxies, a new source 
population on the sky is not necessary to explain the observations. 
However, a physical mechanism that increases the clustering 
amplitude is needed. Motivated by recent results related to the 
extended stellar light profile in dark-matter haloes'*"*, we consider 
the possibility that the fluctuations originate from intrahalo stars 
of all galaxies. We find that the measured power spectrum can be 
explained by an intrahalo light fraction of 0.07 to 0.2 per cent relative 
to the total luminosity in dark-matter haloes of 10° to 10’ solar 
masses at redshifts of about 1 to 4. 

To distinguish between the two interpretations of the near-infrared 
anisotropy power spectrum, we have analysed imaging data from the 
Spitzer Deep, Wide-Field Survey (SDWES)'*. This survey covers 10.5 
square degrees on the sky with the IRAC instrument in its four bands 
between 3.6 and 8 lum. We focus on the data at 3.6 and 4.5 tim, as the 
confusion from zodiacal light limits extragalactic background studies 
at 5 and 8um (ref. 8). The data were taken in four separate epochs 
between 2004 and 2008, and observations were conducted in ways to 
minimize the systematic errors associated with anisotropy measure- 
ments. In particular, the four different epochs were observed at different 
roll angles of the instrument so that the measurements are robust 
against detector artefacts, persistence resulting from saturated bright 
stars, and variations in the bias level. The SDWFS mapping strategy was 
also optimized to facilitate self-calibration’’ of the data by maximizing 
inter-pixel correlations. 

To limit the influence of bright stars and galaxies, including extended 
sources, in our anisotropy measurements, we mask all sources that 
are detected either in the combined SDWFS data or in the ancillary 
multi-band optical and near-infrared data'*. The effects of assembling 
(‘mosaicing’) individual detector frames, pixelization of the maps, and 
the detected-source mask are captured by the map-making transfer 
function (Supplementary Information). We compute the transfer 
function and its uncertainty with a large set of sky simulations. The 
point-spread function (PSF) and its uncertainty were determined by 
measuring and modelling the PSF of stars in different subregions of the 
image and computing the variance of the differences between the 
modelled PSFs. 


The power spectrum measurements at 3.6 and 4.5 jm show a clear 
excess above the shot-noise level (Fig. 1). The shot noise dominates the 
anisotropy power spectrum at subarcminute angular scales corres- 
ponding to angular multipole moment £> 10°. Such a shot noise is 
expected from the small-scale Poisson behaviour of the spatial distri- 
bution of sources on the sky. The clustering amplitude we measure at 
¢~10* is fully consistent with existing measurements of the anisotropy 
power spectrum with IRAC in deeper, but smaller-area, fields*”"”. At 
angular scales of tens of arcseconds, corresponding to 0>5 x 104, our 
shot-noise level is higher than that of a recent measurement by a factor 
of about 2 because deeper data allow more of the fainter sources to be 
individually detected and masked’. Nevertheless, we independently 
confirm the near-infrared background anisotropies at angular scales 
larger than a few arcminutes at the previously reported amplitude’’. 

The near-infrared anisotropies have been previously interpreted as 
either due to spatial clustering of primordial galaxies responsible for 
cosmic reionization® or due to faint, dwarf galaxies at low redshifts that 
fall below the individual source detection threshold of Spitzer images*””. 
Figure 1 shows that the measured fluctuations in SDWFS are well 
above both these interpretations. The power spectrum predictions 
for galaxies at redshifts z>6 rely on a combination of analytical 
calculations’? and numerical simulations” of reionization. If we 
force the z>6 galaxy model to fit the power spectrum data, then 
the integrated intensity of z>6 galaxies is about 2nWm “sr ‘ at 
3.6 jum (ref. 19). To reach such a high intensity, these galaxies must be 
very efficient in converting baryons to stars”. In fact, the required star- 
formation rate conflicts with the measured metal abundance at z > 4, 
the measured X-ray background when compared to X-rays from stellar 
end products such as black holes, and the measured luminosity func- 
tions of bright Lyman-dropout galaxies”’. Unless a significant revision 
of our current understanding of z> 6 galaxy statistics is made, it is 
unlikely that the measured anisotropy power spectrum is dominated 
by the primordial galaxies. 

A large compilation of multi-wavelength luminosity functions 
and galaxy number counts’ allow predictions to be made for the 
intensity fluctuations due to low-redshift, faint galaxies. The measured 
luminosity-function slope at the faint end is used to extrapolate to the 
fainter galaxies that are undetected in the Spitzer images. An increase 
in the faint-end slope above the measured values does not increase the 
clustering amplitude on the angular scales of interest without modifying 
the shot-noise level. Although the clustering amplitude is smaller than 
the measurements, the prediction related to faint, dwarf galaxies’ shows 
that they generate a shot-noise level consistent with the measured 
small-scale anisotropy power spectrum (Fig. 1). At angular scales of a 
few tens of arcminutes, the measurements are such that the clustering 
amplitude is a factor of about 6 to 10 above the prediction. Although this 
difference suggests that a new model to explain the anisotropy power 
spectrum is clearly needed, the consistency with the shot-noise level is 
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such that we do not need to invoke a new population of point sources on 
the sky to explain the observations. 

While keeping the shot-noise level the same, the measurements can 
be explained by any physical effect that boosts the two-halo term of the 
halo model of galaxy clustering”. One possibility is to increase the halo 
masses of the faint, dwarf galaxies so that their clustering amplitude is 
increased with a corresponding increase in their large-scale bias factor”. 
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Figure 1 | The angular power spectrum of the unresolved near-infrared 
background. Shown is the total power spectrum (*C;/2n of SDWFS at 3.6 pm 
(a) and 4.5 [tm (b) as a function of the multipole moment ¢. The corresponding 
angular scale 271/€ is listed on the upper x axis in units of arcminutes. SDWFS 
imaging data were taken on the same field at four separate epochs in January 
2004, August 2007, February 2008 and March 2008. Each epoch of data, taken 
over 7 to 10 days, includes 4,300-4,900 IRAC frames that were combined to 
make mosaics using the self-calibration algorithm’. The total integration time 
is 6 min per pixel. These individual frames were first visually inspected and 
cleaned of artefacts such as asteroidal trails and hot pixels. Through cross- 
correlations between sum and difference maps between epochs, we make 
independent measurements of the sky signal and noise. The final power 
spectrum (filled circles) is the average of the multi-epoch cross-correlation data 
under the assumption that the instrumental noise is not correlated between 
epochs. The two shaded regions show the expected contribution from z > 6 
galaxies’ (blue) and low-redshift galaxies’ (yellow) based on two model 
predictions in the literature. The lines shows a diffuse IHL model where we 
show the signal in terms of the total (solid), one (dashed-dotted) and two 
(dotted) halo terms. The dashed line is the best-fit shot-noise signal that 
dominates the anisotropies at small angular scales. In a and b, the error bars are 
lo uncertainties in the power spectrum. They are determined by propagating 
the errors from the beam measurement into the power spectrum, while the 
simulations, based out of noise measurements, were used to obtain 
instrumental and sky variance. The quadratic sum of these errors and the map- 
making transfer function uncertainty constitutes the final error estimate. 
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The required modification needed to explain the fluctuation data is, 
however, ruled out by the measured number counts and the redshift 
distribution’. Because intensity anisotropies are measured, another 
option is to introduce a luminosity component to the dark-matter 
haloes that remain unmasked when the hosted bright galactic disks 
are masked as part of the analysis. Such a possibility exists in the 
literature in the form of diffuse halo stars in the extended stellar profile 
of galaxies out to distances of 100 kpc (ref. 23). In our anisotropy 
measurements, we mask the faintest detected galaxies to 3-4 arcsec, 
which removes the light from the bulges and disks of those galaxies. To 
remove the diffuse light component, we would have to mask to a radius 
greater than 10 arcsec around each galaxy. The surface density of 
galaxies down to AB-magnitude map < 22 at 3.6 1m is such that we 
expect 2 to 3 galaxies within a circle of radius 10 arcsec. Thus masks 
which successfully remove the diffuse component leave no pixels on 
the map from which to measure the anisotropy power spectrum. 
Existing studies discuss this extended emission in terms of the diffuse 
intrahalo light (IHL)”, explained as originating in tidally stripped stars 
produced during galaxy mergers and collisions. The fraction of stripped 
stars is expected to be a function of the halo mass, with more massive 
haloes containing a larger fraction of the diffuse halo emission’*'***. 
On galaxy cluster scales, the diffuse intracluster light’*”° is a significant 
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Figure 2 | The IHL fraction from diffuse stars as a function of the halo mass. 
The dark and light blue shaded regions show the 95% and 68% range of 
intrahalo light fraction fi, relative to the total luminosity of the dark-matter 
haloes as a function of the halo mass M from an analytical prediction“, valid for 
fru > 4X 10 *and M>5X 10! Mo and at z= 0. We show the case where 
dark matter subhaloes on orbits passing within a critical radius of the host halo 
centre contribute their light to the central galaxy rather than to the diffuse 
component. We also show a prediction where fir is constant’*, due to dwarf 
galaxies that are completely destroyed, with a value of ~0.005 when 

M <5~x101! Mo (solid line fixed at fp, = 5 X 10°). The downward arrow 
indicates the possibility that the constant fy, value for low-mass haloes may be 
smaller at higher redshifts. The red and orange hatched regions at the bottom of 
the plot are the preferred 68% and 95% confidence level range, respectively, on 
fru from our analysis of the SDWFS near-infrared anisotropy power spectrum. 
The mass range is determined by the minimum and maximum halo masses 
consistent with the halo model fit that includes the IHL component. Both the 
mass and f(y, ranges are valid over the broad redshift interval from z = 1 to 4 
over which the anisotropy signal is generated. We do not find a significant halo 
mass dependence on the IHL fraction, with the mass-dependent power-law 
having a value of 0.09 + 0.01 between 10” and 10! Mg (see Supplementary 
Information section 9), consistent with the possibility that fiz, is mass 
independent’? when M <5 x 10'! Mo. Our model requires the total 
luminosity/halo-mass relation to evolve with redshift as (1 + z)'2*l This 
luminosity evolution with redshift can also be absorbed into the evolution of 
fre.(M) with redshift. For reference, we also show measurements (open 
diamonds) and 1¢ errors of the intracluster light’, the galaxy group and cluster 
analogue for IHL when M>5 x 10° Mo. At halo masses around 10’ Mo we 
show (red arrows) the 95% confidence level upper limit on fiz, estimated for 
Milky Way” and Andromeda (M31)”. 
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Figure 3 | The spectral energy distribution of infrared background 
anisotropies. Data symbols show the frequency spectrum of near-infrared and 
optical background anisotropies as a function of the wavelength. We show the 
r.m.s. fluctuation amplitude at / = 3,000, corresponding to fluctuations at 
6-arcminute angular scales; our measurements (black filled circle) and the existing 
measurements in the literature (purple filled squares, ref. 11; red crosses, ref. 12) 
are presented at the same angular scales. Error bars are the 1o uncertainties directly 
propagated to a r.m.s. fluctuation amplitude from the power spectrum errors. We 
do not show the measurements at 1.6 and 1.1 jm (ref. 10) as they do not probe 
fluctuations on scales greater than 2 arcmin on the sky due to the narrow field of 
view of the observations. The shaded regions are predictions for the IHL taking a 
variety of spectral energy distributions for the stripped stars (green), z > 6 galaxies 
(yellow) and low-redshift galaxies below the detection level of masking (blue). 


fraction of the total luminosity of the cluster. We describe the intensity 
anisotropy power spectrum from the IHL by modifying the standard 
galaxy clustering predictions” to include a profile for the diffuse stars in 
haloes (see Supplementary Information section 8). If the clustering 
excess in near-infrared anisotropy power spectrum is due to IHL, then 
we find that measured anisotropies can be described with haloes in the 
mass range of 10° to 10'* solar masses (M<). Averaged over this mass 
range, we find an IHL fraction relative to the total luminosity fiz, of 0.07- 
0.2% at a 68% confidence level (Fig. 2). The implied fraction is consistent 
with the theoretical expectation that the IHL level is small for low-mass 
haloes, but differences also exist with current theory predictions'*", 
especially in terms of the power-law slope of the halo mass dependence. 

If this new interpretation involving IHL is the correct description 
of measured infrared background anisotropies, we find that the 
IHL in all dark-matter haloes that we are probing contribute 
0.75 +0.25nWm “sr " to the total intensity at 3.6 jum. This intensity 
can be compared to the r.m.s. fluctuations of about 0.1 nW m * sr ‘atan 
angular scale of a few arcminutes (see Fig. 3). The IHL fluctuation signal 
varies spatially at a level of 10-15% of its integrated intensity and below 
1% of the total background intensity of 13.3+2.8nWm “sr ' at 
3.6 um (ref. 27). As the spectral energy distribution of IHL is mostly 
unknown, we make use of a variety of spectral energy distributions 
(SEDs) from B- to K-type stellar spectral templates and find an order 
of magnitude variation at wavelengths of ~1 1m (Fig. 3). In all these 
cases, we predict the existence of optical background light fluctuations. 
They will have a similar power spectrum shape and will be fully correlated 
with fluctuations at 3.6 um. Furthermore, the near-infrared anisotropies 
we have measured should be correlated with the submillimetre anisotro- 
pies”®, especially if there is diffuse and extended dust associated with 
galaxies. These form future tests that can be used to improve our under- 
standing of the content and nature of IHL in distant dark-matter haloes. 
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Quasi-periodic events in crystal plasticity and the 
self-organized avalanche oscillator 


Stefanos Papanikolaou’, Dennis M. Dimiduk?, Woosong Choi’, James P. Sethna’, Michael D. Uchic?, Christopher F. Woodward? 


& Stefano Zapperi*? 


When external stresses in a system—physical, social or virtual—are 
relieved through impulsive events, it is natural to focus on the 
attributes of these avalanches’*. However, during the quiescent 
periods between them’, stresses may be relieved through compe- 
ting processes, such as slowly flowing water between earthquakes* 
or thermally activated dislocation flow* between plastic bursts in 
crystals® *, Such smooth responses can in turn have marked effects 
on the avalanche properties’. Here we report an experimental 
investigation of slowly compressed nickel microcrystals, covering 
three orders of magnitude in nominal strain rate, in which we ob- 
serve unconventional quasi-periodic avalanche bursts and higher 
critical exponents as the strain rate is decreased. Our experiments 
are faithfully reproduced by analytic and computational dislocation 
avalanche modelling’™ that we have extended to incorporate dis- 
location relaxation, revealing the emergence of the self-organized 
avalanche oscillator: a novel critical state exhibiting oscillatory 
approaches towards a depinning critical point'*. This theory sug- 
gests that whenever avalanches compete with slow relaxation—in 
settings ranging from crystal microplasticity to earthquakes— 
dynamical quasi-periodic scale invariance ought to emerge. 

Physical systems under slowly increasing stress may respond through 
abrupt events. Such jumps in observable quantities are abundant, being 
found in systems ranging from complex social networks to earthquakes. 
Even though these avalanches appear randomly sized and randomly 
placed, the statistical properties of avalanches are universal, falling into 
well understood non-equilibrium universality classes. The main uni- 
fying concept is the depinning of an interface under an external field. 
An implicit assumption underlying this concept is that all other coexis- 
ting physical processes are either too fast, and thus average out, or too 
slow, rendering a static approximation valid. However, the latter as- 
sumption is not always true if the slow processes rearrange the pinning 
landscape at rates comparable to the external field driving rates. For as 
the fast avalanches are scale-invariant, the whole time series, including 
the waiting intervals between the fast events, is also scale-invariant. It is 
within the waiting intervals that a slow restructuring of the pinning field 
can thrive and alter universal predictions. 

Although intermittent plastic flow is well known”, only recently was 
it shown as being statistically akin to universal mean-field avalanche 
behaviour in the quasi-static limit. Investigations of the phenomenon 
have used a wide variety of techniques, including acoustic emission 
from deforming ice’, high-resolution extensometry of tensile strained 
copper’, and microcrystal compression tests for face-centred cubic and 
body-centred. cubic crystals'’. However, most of these single-crystal 
studies covered only a narrow range of nominal high strain rates. Pre- 
liminary evidence that suggests a more complex physical picture was 
discussed in ref. 16, where a rate dependence of the cumulative strain 
event size distributions was observed. Interesting rate effects have also 
been observed in materials with solute atoms, typically polycrystal- 
line, that display the Portevin-Le Chatelier (PLC) effect’’””’. The PLC 


avalanche distribution exponents show no evidence of strain-rate de- 
pendence (although strain dependence is shown), whereas PLC at lower 
rates turns into similar-size localized slip excitations and chaotic beha- 
viour”®, distinctly different from the physical behaviour observed in ref. 
16. Instead, the PLC avalanche behaviour is more consistent with the 
phenomenology of theories of avalanches with weakening effects”. In 
the experiments we report here, single nickel microcrystals of compara- 
tively large dimensions, having diameters between 18 and 30 jum, were 
uniaxially compressed’”. By controlling the applied external stress to 
maintain a nominal strain rate and by detecting slip with extremely 
sensitive extensometry, we track crystal displacements in time. In order 
to study the rate dependence, we perform our experiments at three 
different nominal strain rates (10 45-110 °s 4,10 °s 1. For each 
sample, the time series of the displacement time derivative is filtered 
using optimal Wiener filtering methods adapted for avalanche time 
series”, and avalanche events are appropriately defined without using 
thresholding. 

As deformation proceeds in the microcrystals, the dislocation 
ensemble evolves at different timescales. The most apparent activity 
is associated with fast glide processes, which produce stochastic plastic 
bursts. Concurrently and between these events, other less observable 
processes (Fig. 1) contribute to collective slower relaxations. Like glide, 
these too are thermally activated processes accessible at these high 
levels of stress, but have different activation barriers: for example, 
the viscoelastic response of the dislocation forests after fast avalanche 
strain bursts, the localized dislocation climb motion in directions other 
than the glide plane under high local stresses, and also the cross-slip 
processes of dislocations shifting between glide planes. They all com- 
pete to minimize the far-field stress while changing the local stress 
landscape and bypassing the fast glide process. They affect dislocation 
slip, but at a slower rate than avalanche glide”’. In our experiments, we 
classify as ‘slow relaxation’ all the deformation that does not belong to 
avalanches of the scaling regime. Using this definition, the slow relax- 
ation fraction increases drastically at the two slowest strain rates. Thus, 
rate dependence of the avalanche size distribution (Fig. 2c) occurs 
when the nominal strain-rate becomes comparable to the rate of the 
slow relaxation processes (Fig. 1). Although the exact mechanisms are 
unknown, one localized reorganization mechanism possible at these 
large local stresses and low temperatures (0.17 times the melting tem- 
perature, ~300K)** could be tied to newly discovered unconven- 
tionally large cross-slip rates calculated for similar conditions to our 
experiments”. Regardless of the possible types of relaxation mechan- 
isms, we focus on the experimental fact that relaxation and driving 
rates become comparable. We model phenomenologically the slow 
relaxation in an intuitive manner, and then show a posteriori that 
our results are independent of the particular form of relaxation 
dynamics (see Supplementary Information). 

The slip event sizes S, labelled by their beginning time, display a 
striking dependence on the driving rate. After we smooth the time 
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Figure 1 | Dislocation motion and several slow relaxation processes during 
the waiting intervals between avalanches. a, Diagram of typical unit 
dislocation motions. Lighter to darker indicates time evolution. Under stress, a 
dislocation loop nucleates and grows until it gets pinned on its slip plane, which 
is a common and fast glide-slip burst unit process. Then, a screw dislocation 
segment undergoes double cross-slip to a parallel slip plane, bypassing glide 
barriers. Finally, the dislocation glides and ultimately, may climb. These unit 
processes underlie the dislocation ensemble dynamics (not shown). b, Strain 
jump rate time series of a nickel sample at a strain rate of 10 °s |. The 
avalanche phenomenon not only involves fast and violent scale-invariant 


bursts”, but also long waiting times’ between glide events. During those times, 
slow relaxation events happen that are typically hard to experimentally 
distinguish due to the external noise levels (inset). c, Black data points show the 
estimated strain percentage accumulated in slow relaxation events (generally 
called ‘creep’), with the threshold set by the event size distribution (Fig. 2): this 
percentage (relaxation strain/total final strain) strongly increases as the rate 
decreases. Experimental noise contributes to the relaxation strain measured. 
Red data points show that a non-trivial quasi-period (see Fig. 2) of avalanche 
behaviour emerges and increases dramatically as the nominal rate decreases. 
Error bars indicate the size of systematic variability due to thresholding. 
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Figure 2 | Comparison between microplasticity experiments and theoretical 
modelling. a, Average avalanche size in 400-s windows versus time for 
different strain rates (decreasing top to bottom: 10-4103, 10> %s"4, 
corresponding to respective velocities of the microcrystal top surfaces of 4, 0.4, 
0.04nms '). Time axes are rescaled by the nominal strain rate, aligning the 
‘strain scales’. Quasi-periodic avalanche behaviour emerges as the nominal 
strain rate decreases. The period is similar in ‘strain scale’-—a key prediction of 
our theory. b, Stick-slip oscillations observed experimentally in a show typical 
characteristics of the model of equation (1) (using 400d-long averaging 
windows). The relaxation rate is fixed (R = 2) and the strain-rate is varied (by 
modifying c: top to bottom; 1077/5, 10” */6, 10” */6), following the 
experiments. The unit of strain is 2X 10° and the fast timescale 5 = 0.5 s. We 
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show the actual avalanche events without the distortion that appears due to the 
strain coming from slow relaxation; this difference gives the overall scale 
mismatch of a and b. c, The probability distribution P(S) is shown. There is a 
marked increase in the critical exponent for the size of the displacement jumps 
as the strain rate decreases. The highest (10 *) and lowest (10°) strain rates 
are fitted to power laws S”'°f(S/So) and S~!°f(S/So), respectively. The key 
shows strain rate and corresponding velocity for data points. d, In the model of 
equation (1), the variation of the rate c shows similar behaviour to that observed 
in c, with exponent drift from ~1.5 (ref. 8) to ~2.1, with fitting error ~0.2, 
consistent with the discussion in the text and with fitting cut-off functional 
forms f(S/So) that are discussed in Methods. 


©2012 Macmillan Publishers Limited. All rights reserved 


series over a fixed window of 400s and then rescale the time axis to 
display comparable strain evolution, very clear (Fig. 2a) oscillatory-like 
behaviour emerges at the 10 °s | rate. The emergent period displays 
a strong dependence on the strain rate, while its magnitude reaches 
~8h (for 10 °s '), consistently much larger than the length we chose 
for the fixed window averaging (Fig. 1c). The novel behaviour is also 
reflected within statistical distributions of S: these show power-law 
behaviour (P(S) x S*) for all studied strain rates (10 “s ', 
10 °s 4,10 ° s }), while the value of the power law exponent t drifts 
from ~ 1.5 (consistent with refs 7 and 25) toa higher, unexpected value 
of ~2.0 for the slowest strain rate (Fig. 2c). Analogous behaviour is 
observed for the avalanche durations T and their correlation with the 
sizes. 

Our explanation of the experimental data builds on the model 
framework of dislocations moving through a disordered landscape 
of forest dislocations, on a single slip plane under shear stress. This 
is a successful picture for avalanches during stage I plasticity'®"’”* that 
strongly relies on well-understood models of 2+ 1-dimensional inter- 
face depinning’*. We construct a minimal generalization via an added 
relaxation term, (1): 


() (II) 
pn 
dp(r) _ (@ 


=D a) Beh b-epeGin—nw@y 
t Ll be 


Here r denotes the location on the slip plane, t is time, © is the 
Heaviside step function, y is the shear modulus of the system, <1 
is a dimensionless constant that controls the timescale separation of 
the two processes (I) and (II), is the stress exponent dependent on the 
slow relaxation mechanism, ¢ is the local applied stress and o; denotes 
random stress barriers to glide slip. The basic slip variable of the system 
¢ is the y-x component (Burgers vector along x) of the plastic distor- 
tion tensor when only infinite dislocations along z on x-z slip planes 
are considered”. Part (I) of equation (1) denotes the coarse-grained 
relaxation of edge dislocations, with rate D at fixed temperature. Only 
positive slip motion is considered to simplify our simulations, and we 
have shown that our conclusions are qualitatively independent of such 
assumptions (see Supplementary Information). With D we define an 
effective rate of thermally activated processes that lead to slow relax- 
ation. We set the exponent n = 1 but our conclusions do not qualita- 
tively depend upon it. The applied stress is the x-y component of the 
stress tensor: 


O(1)=Cext +6 int(1)+Onara(r) = Mct + | d’r'K(r—r')b(r')—ko(r) (2) 


where o,,; is the external stress with c the stress increase rate, 0;,; is the 
dislocation local stress, Gyarq is the local stress due to dislocation hard- 
ening with k a phenomenological hardening parameter, and r’ is 
defined on the slip plane. 

We consider a stress-controlled test in a stationary plastic regime 
(Gext = Mct)*®, where M is a machine stiffness, and c has strain-rate 
units. The relative timescales of the relaxation and stress rate are con- 
trolled by the dimensionless parameter R = D/c. Term (II) of equation 
(1) describes the fast glide process which drives the avalanche 
dynamics. Hardening is phenomenologically represented via a coef- 
ficient k that controls the distance from the depinning critical point. 
For clarity, we separate the relevant timescales by considering «<1, 
leading to infinitely fast avalanches compared to the slow relaxations. 
Finally, o;,, contains the appropriate interaction kernel K for single 
slip straight edge dislocations’? and o¢ denotes the uncorrelated local 
pinning potential due to dislocation forests. We find that our main 
qualitative conclusions are independent of the kernel, and thus are 
equally applicable to other models of avalanches in plasticity. If we 
were considering the model of ref. 11 where mixed dislocations are 
included, we would modify our definitions by using a single x-y slip 
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plane, assume ¢ to be the x-y tensor component, and only apply the 
z-x component of the stress. 

The model of equations (1) and (2) is solved by explicit integration 
on a two-dimensional grid: for no relaxation (D = 0), the avalanches 
display statistics consistent with the predictions of the mean-field 
theory of interface depinning**. As D increases, both the critical expo- 
nent Tt for strain jump sizes S (P(S) « S *) and the critical exponent « 
for event durations T (P(T) « T *) increase substantially. In the con- 
text of mean-field theory, somewhat similar behaviour takes place 
when the driving rate c is increased”, leading to avalanche overlap 
and exponents decreasing below the mean-field values; we study the 
case where c—>0, keeping R fixed (and >1) where the exponents 
increase above their mean-field values. 

The increase of the exponents is accompanied by a quasi-periodic 
behaviour, with intermittent but regularly spaced large slip events 
(Fig. 2b), keeping in mind that the term quasi-periodic is unrelated 
to the formal definition of quasiperiodic functions. If one considers the 
average avalanche size in a window, similar to the experimental study 
but without strain from relaxation included, it is clear that the D = 0 
flat-in-time profile is replaced by strongly oscillating profiles in the 
presence of slow relaxation (D > 0). The average avalanche size (D = 0) 
is inversely related to the hardening coefficient k, k « <S>7!. Thus, 
there is a distribution of hardening coefficients being effectively 
sampled, reflecting local heterogeneity. We assume that such a distri- 
bution g(k’) biases the integration, over all possible hardenings k’, of the 
size probability distribution of the D = 0 model, leading to the observed 
dynamically integrated size distribution. That is, a curve in Fig. 2d may 
be obtained as: 


Pint(S) = | e(e)P(s.k)ak (3) 
0 


For the case of interest we have P;,,(S) = S *P(S/Sp), where Sy is the 
cutoff of the size distribution, yielding a higher effective size-exponent 
Tt = 2 for slow strain rates. It is worth noting that in this picture, the 
largest events have a non-trivial scaling behaviour (see Supplementary 
Information). 

The profound effects of slow rate processes within our dislocation 
model and the comparison with experiments forces us to ask if our 
findings are general. To make analytical progress towards an answer, 
we consider the slip susceptibility p, the multiplier giving the net 
number of local slips triggered by a single slip. Here, (p) is proportional 
to the hardening coefficient, (g) « 1—k. In traditional mean-field 
interface depinning models’, this is the ‘distance’ of the system from 
the critical depinning point and is saturated to a fixed point value after 
short-time transients. When p <1, the system is far from critical, 
whereas the system is near critical when p ~ 1. Numerical solutions 
to equation (1) verify that the additional relaxation process affects p in 
an unusual way. When an avalanche with size S, takes place, p instan- 
taneously decreases proportionally to S,, whereas it increases linearly 
between avalanches. We suggest that the basic physical mechanism 
behind the behaviour of equation (1) (with c— 0 but R fixed) is given 
by the behaviour of the slip susceptibility p, whose basic characteristics 
can be described by a Markov process: 


S 
Pr4i— Pr =Apy ca(a :) (4) 


where S; is mean-field P(S,) = NS, >” exp(—S;/So); here N is a nor- 
malization factor, Sy = a/(1 — p,)” (ref. 1) and the step cq can be 
thought as being proportional to R. The traditional avalanche mean- 
field behaviour is described by the cg— 0 fixed point (analogous to 
higher experimental strain rates). The size of the avalanche at time tf, 
Sp is a stochastic variable which mimics the avalanche dynamics of 
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Figure 3 | The avalanche oscillator mechanism and stochastic modelling of 
the slip susceptibility. a, As the rate cy increases (key), for S=0.1, large noise 
causes p(t) of equation (4) to oscillate between p ~ 1 and small p, causing larger 
exponents. b, The probability distribution of p. As cq increases (stronger 
relaxation, slower strain rate), any becomes equiprobable. In the inset, we use 
{S)so calculated for equation (1), averaged with a running window of size 500. 
The simulations of equation (1) have the same parameters as in Fig. 2. The 


equation (1). When cy < 1, p increases in small steps towards the fixed 
point (Fig. 3a) given by py = 1 — (2a/2S) with average size S (a being 
the minimum accessible size). 

However, there is a finite probability of a large avalanche which 
takes the system far from the fixed point, with Ap, large and negative. 
If 5S = S,— S, then Ap; = —cg5S/S ~ —1 indicates the emergence of a 
novel quasi-periodic behaviour (Fig. 1b) showing large negative jumps 
in p with rare avalanches whose sizes S, are much larger than S. p 
performs a Sisyphean task, constantly ascending towards the original 
critical point before the sharp descent after a large rare avalanche. In 
this way, the distribution of p effectively flattens (Fig. 3b) as cg 
increases (low experimental strain rates), leading to larger integrated 
exponents (equation (3)). Consistently, the analogous distribution for 
equation (1) flattens as R increases (Fig. 3b, inset). The rare 6S events 
scale with Sy x 45?/(ma) and qualitatively, there is a transition when 
ca am/(48) ~ 1/(S/a) (Fig. 3c). We call a system undergoing this 
qualitative behaviour an ‘avalanche oscillator’, based on its strong 
resemblance to the case of relaxation limit cycle oscillations near a 
singular Hopf bifurcation with stochastic perturbations”. 

Single microcrystals display a rich collection of novel mechanical 
behaviours: together with size effects'’ and the emergence of avalanche 
slip events®’, the importance of often-neglected slow processes on 
intermittency has now come to light. The presented experiments at the 
microscale now force us to reconsider our understanding of the macro- 
scopic world, such as disordered solids and earthquake faults*~*°. Our 
general theory proposes that whenever avalanches compete with other 
slow coexisting processes to minimize the local internal stress, the 
dynamics will give rise to the self-organized avalanche oscillator. 


METHODS SUMMARY 


The experimental measurements were performed using the methodology 
described earlier'*'®. The data are taken at time resolutions of 5, 50 and 500 Hz 
for different samples. The nominal strain rates (in s 1) were 10 4,10 °and10 ° 
with corresponding average platen velocities of 4, 0.4 and 0.04nms_ ', given the 
dimensions of the microcrystals. The experimental time series were filtered using 
Wiener filtering methods optimized for studying avalanches”. In the simulations 
of equation (1), Euler time stepping is used to evolve the differential equation on an 
LX L grid. During an avalanche, the stress is not increased and the relaxation term 
(I) does not participate in the evolution. During the avalanche process, we evolve 
the system by using cellular automata rules: when the total local stress crosses its o¢ 
threshold, the associated local slip ¢ increases randomly with a normal distri- 
bution of mean 1 and variance 1. The assumption of strict positivity in the local 
slip is used for simulation efficiency purposes, without affecting our conclusions. 
In the stress of equation (1), we have also added a term for regularizing purposes 
that slightly smooths the slip profiles. It takes the form «V*g with very small 
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histograms, shown in the appropriate scale (1 — 1/(S) = p for the kernel used) 
shows qualitatively similar flattening behaviour as equation (4). c, The novel 
regime (‘integrated quasi-periodic’) with large p fluctuations is separated from 
the traditional regime p ~ pi. The line cy oc 1/S shown, as described in the text. 
Tt was calculated using equation (3) at equidistant points with a final Gaussian 
interpolation for the colour background. 


a = 0.05. In our simulations, we used a flat distribution ranged in (0,1] for the 
quenched disorder o((r), following a typical protocol. The kernel K(r) has a con- 
tinuum Fourier representation K(k)=—- Ck,7k,7/ (ko + oe where we set C= 1 
for clarity in our analysis. In the simulations of equation (4), the stochastic equa- 
tion was solved using random variables that follow the required power-law dis- 
tribution with exponential cut-off, generated with standard rejection methods. 
During the numerical solution of equation (4), p can jump above 1, a regime we 
do not consider (see Supplementary Information). 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Experimental. The data are taken at time resolutions 5, 50 and 500 Hz for different 
samples, depending on the case. The nominal strain rates were 10°*, 107°, 
10 °s ' with corresponding average platen velocities of 4, 0.4 and 0.04nms_', 
given the dimensions of the pillars. Optimal Wiener filtering corresponds to a low- 
pass filter that has significant effects only at short timescales, which are plagued by 
apparatus problems. Ina similar fashion to ref. 24, we performed adequate tests in 
order to confirm that the power-laws and the long-time quasi-periodic behaviour 
were not related to the filtering procedure. 

Theoretical. In the simulations of equation (1), during diffusion, Euler time 
stepping is used to evolve the differential equation on a L X L grid. During an 
avalanche, given that ¢ — 0, the stress is not increased and the relaxation term (I) 
does not participate in the evolution. This approximation was performed for 
clarity purposes, with qualitatively similar results with the case ¢ = 1. In that case, 
the effect of diffusion is more visible and avalanches dissipate (for large D) a much 
smaller stress percentage, since the relaxation term dominates the behaviour. 
During the avalanche process, we evolve the system by using cellular automata 
rules: when the total local stress crosses its ¢¢ threshold, the associated local slip ¢ 
increases randomly with a normal distribution with mean 1 and variance 1. The 
assumption of strict positivity in the local slip is used for simulation efficiency 
purposes, without affecting our conclusions, as we demonstrate in Supplementary 
Information”’. In the stress of equation (1), we have also added a term for regu- 
larizing purposes that slightly smoothens the slip profiles. It takes the form «V7 
with very small « = 0.05. We have checked for several system sizes (up to 64”) that 
this term does not affect our reported results in any visible manner. Also, we note 
that this term is physically motivated, in as much as it is connected to the coarse- 
grained form of the stress generated by dislocation pile-ups*. In our simulations we 
used a flat distribution ranged in (0,1] for the quenched disorder o((7), following a 
typical protocol. The kernel K(r) has a continuum Fourier representation 
K(k) = = Ck ky Mk? + ky)? (ref. 10), where we set C= 1 for clarity purposes 
in our analysis. A modification of C modifies the strength of disorder required in 
the model in order to observe avalanche behaviour, with no other changes. In all 
simulations using equation (1) (unless explicitly mentioned otherwise) the hard- 
ening coefficient k is selected from the formula k = 2L°*°/S) where we chose 
So = 1,000 with So being approximately equal to the cut-off size of the distribution 
that is derived for D = 0. The reason for this choice has to do with the fact that the 
nature of the kernel is such that a fixed local hardening coefficient k does not set 
the cut-off for the size distribution. Rather, it allows for a weak increase with the 
system size. However, for our purpose (studies of D > 0) it was crucial to have well 
controlled critical distributions for D=0, independent of the system size, to 
identify the concrete effects of the relaxation on the distributions. In all plots we 


refer to the value of R = D/c. We note that all our conclusions remain qualitatively 
unaltered ifa strain-rate-controlled test is considered, while the only requirement 
we identified for the emergence of the avalanche oscillator is the existence ofa large 
range of time intervals between distinct events, as self-similarity requires’. The 
independence of the avalanche oscillator behaviour from the external forcing type 
is in contrast to typical microscopic friction stick-slip**™* or coarse-grained weak- 
ening’ modelling that lead to stick-slip avalanche periodicity and typical infinite 
off-critical events**. In the simulations of equation (4), the stochastic equation was 
solved using random variables that follow the required power-law distribution 
with exponential cut-off, generated with the standard rejection method’. While 
solving equation (4) numerically, p can jump above 1, a regime that we do not 
consider. There are several options to deal with the boundary condition at p = 1 
which are numerically very similar for large S and small cy. After a jump which 
takes p > 1: (1) p is reset to a random value between 0 and 1; (2) p is reset to a 
specific value (for example, 0 or fo); (3) p is returned to its previous value and the 
step is rejected (this method was used for the generation of Fig. 3 centre). We shall 
reiterate that these crossings (p > 1) are regularization/finite-size effects and do 
not define the system’s behaviour at long times and in the limit of S/a > ~, as we 
verified in both simulations of equation (1) (showing that the distribution ‘bump’ 
consistently vanishes with the system size) and equation (4) (showing that differ- 
ent treatments of the p = 1 boundary lead to the same conclusion and phase 
boundary cy x 1/S). Finally, in Fig. 2, the fitting functional forms used were 


ia ron 3 ‘ . 
CoS Te 18/80) +2V'S/Sowhere Co, C1, Co) Cz and T are fitting parameters. As it 


appears from our theoretical study, the cut-off functions f(S/So) are rate depend- 


ent. For example, in Fig. 2c, we find c3 = 3/2 at 10° * rate, while itis c; = 1 at 10 °. 


In Fig. 2d, c; = 2 at the low rate, while it is c; = 1 at the high one. 
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Forming all-carbon quaternary stereogenic centres 
in acyclic systems from alkynes 


Yury Minko', Morgane Pasco!, Lukas Lercher', Mark Botoshansky” & Ilan Marek! 


The formation of all-carbon quaternary stereocentres in acyclic 
systems is one of the most difficult contemporary challenges in 
modern synthetic organic chemistry”. Particularly challenging is 
the preparation of all-carbon quaternary stereocentres in aldol 
adducts’; this difficulty is problematic because the aldol reaction 
represents one of the most valuable chemical transformations in 
organic synthesis*. The main problem that limits the formation of 
these stereocentres is the absence of an efficient method of prepar- 
ing stereodefined trisubstituted enolates in acyclic systems” *. Here 
we describe a different approach that involves the formation of 
two new stereogenic centres—including the all-carbon quaternary 
one—via a combined carbometalation-oxidation reaction of an 
organocuprate to give a stereodefined trisubstituted enolate. We 
use this method to generate a series of aldol and Mannich products 
from ynamides with excellent diastereomeric and enantiomeric 
ratios and moderate yields. 

In the past few decades, stereoselective synthesis has played a crucial 
role in organic chemistry, leading to the rapid development of myriad 
widely used synthetic transformations. One of the remaining signifi- 
cant challenges in this field is the development of strategies for the 
enantioselective creation of carbon atoms bonded to four different 
carbon substituents’, particularly in acyclic systems*. Several attractive 
methods based on asymmetric catalysis have therefore been developed 
in recent years’. However, the current limitation of all of these asym- 
metric approaches is that they combine only two components, result- 
ing in the formation of a single carbon-carbon bond in the critical step. 
Consequently, attention has focused on how to synthesize quaternary 
stereocentres efficiently, in a minimum number of steps, with op- 
timum convergence, with as little as possible functional group trans- 
formation, little or no by-products and maximum atom-efficiency’. 
Simply put, we need to accomplish more with less’®, making use of new 
reactions or concepts’’. The number of steps in the synthesis is of 
paramount importance’’. This challenging goal of step economy can 
be achieved by using reactions that allow a great increase in complex- 
ity, or through one-pot operations that string together many steps that 
collectively achieve the same high complexity increase’’. This last 
approach is indeed generally beyond our reach, particularly when 
addressing challenging issues such as the creation of all-carbon qua- 
ternary stereocentres in acyclic systems". 

For the past few years, we have concentrated our efforts on the 
creation of such quaternary stereocentres. We found that acyclic 
homoallylic alcohols possessing the desired all-carbon quaternary 
stereocentre could be formed with excellent diastereomeric and enan- 
tiomeric ratios from simple alkynes, through successive in situ syn- 
thetic transformations with important stereochemical implications’*’”. 
We turned our attention to reactions that form aldol products, as 
they represent one of the most versatile carbon-carbon bond forming 
processes available to synthetic chemists’, and the formation of all- 
carbon quaternary stereogenic centres at the «-position to carbonyl 
groups remains elusive and has been the focus of only four independent 
studies, first in racemic’*”? and then in enantiomerically enriched 


forms’”®. A different approach that avoids the issues associated with 


enolate geometry is the use of silyl ketene imines that successfully led 
to aldol surrogates”’. In all of these examples*'*™, only a single carbon- 
carbon bond is formed in the reaction sequence between two compo- 
nents, and as discussed earlier, issues of efficiency and convergence 
have to be addressed. Although aldol adducts could be formed, easy 
and efficient access to stereodefined trisubstituted enolates (1 in 
Fig. 1) as reactive intermediates could drastically increase the scope of 
this chemistry. Indeed, enolates are involved in a multitude of powerful 
applications in asymmetric organic synthesis, but the generation of 
fully substituted acyclic enolates in geometrically defined form re- 
quires resident chirality on the enolizable carbon centre (Fig. 1). 
The E/Z selectivity of these trisubstituted enolates is therefore deter- 
mined either by the stereochemistry and geometry of the starting 
bicyclic lactam (path A in Fig. 1)°, by the configuration of the diaster- 
eomerically pure «-methylbutyramide (path B)°, or by a matched com- 
bination of the chirality of the ester substrates and lithium amide bases 
(path C)”*. 

Our initial approach to tackling the problem of formation of aldol 
adducts possessing a quaternary stereocentre was based on the concept 
we developed for the formation of stereodefined homoallylic alcohols 
through a combined carbometalation—zinc homologation—allylation 
reaction, followed by acidic hydrolysis of the resulting enamides’. 
Despite the success of chiral oxazolidinone imides (Evans’ method)” 
for the incorporation of adjacent stereocentres in aldol reactions, the 
asymmetric construction of all-carbon quaternary carbon stereocen- 
tres is hampered, as discussed previously, by the lack of E/Z selectivity 
in the enolization reactions of acyclic oxazolidinone imides (2 in 
Fig. 2). However, owing to the attractiveness of these auxiliaries, we 
reasoned that the selective formation of trisubstituted enolates 1 from 
oxazolidinone-based chiral auxiliaries would constitute the best 
approach to solve the problem of formation of all-carbon quaternary 
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Figure 1 | General strategies for the formation of stereodefined enolates. 
Synthesis of geometrically defined trisubstituted enolate 1 requires substrates 
possessing a stereodefined enolizable sp* carbon centre, such as bicyclic 
thioglycolate lactams (path A), diastereomerically pure methyl-c-ethyl 
disubstituted pseudoephedrine amides (path B), or structurally matched 
enantiomers of the ester and chiral amide base (path C). 
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Diastereoselective 
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centres in not only aldol reactions, but also for reactions with different 
electrophiles. So, the real challenge is to prepare such species through 
the concomitant formation of several bonds in a single-pot operation 
from simple alkynes. 

As we have recently reported the regioselective carbocupration of 
ynamides 3 (ref. 23; easily obtained by a coupling reaction between an 
alkyne and the oxazolidinone)™, the formation of stereodefined eno- 
lates 1 could logically result from an oxidation reaction of the resulting 
vinyl copper species 4, as shown in Fig. 2. However, when traces of 
oxygen are present, instantaneous degradation (that is, dimerization) 
of organocopper species 4 through single-electron transfer to dioxygen 
is observed”. So oxidation of organocopper requires a different ap- 
proach, meaning a different mechanistic oxidation pathway. On the 
basis of our recent work on zinc homologation of vinyl copper with zinc 
carbenoid’*'’, we thought that oxenoid species (oxo-analogues of car- 
benoids) would be good candidates****. 


ros 


Figure 2 | Proposed approach. The applicability 
of the aldol reaction is limited by the lack of 
selective enolization of «,0-disubstituted carbonyl 
compound 2, which results in a mixture of 
diastereomeric products. Here we propose an 
alternative retrosynthetic analysis for the 
formation of aldol-type products 5 and Mannich- 
type adducts 6. It would consist ofa carbocupration 
reaction of ynamide 3 yielding the vinyl copper 
species 4. Then, subsequent stereospecific 
oxidation should lead to the metal-enolate 1 in situ, 
which reacts with an aldehyde to form the aldol 
adduct 5. Use of the ynamide possessing the Evans 
oxazolidinone as a chiral auxiliary should control 
the absolute stereochemistry. Moreover, the 
synthetic use of stereodefined trisubstituted enolate 
could be easily extended to the addition to a large 
variety of electrophiles such as imines for the 
creation of various quaternary stereocentres. 


2 


= alkyl groups 


Here we describe the development and study of reactions that meet 
these challenges and enable this long-sought synthetic strategy. The 
feasibility of this approach was initially checked by reacting vinyl 
copper 4Cu (Fig. 3, method A, R= n-Bu), easily obtained by carbo- 
cupration of ynamide 3 (R= n-Bu)”, with oxenoid t-BuOOLi (see 
Supplementary Information) at low temperature. We found that the 
oxidation reaction proceeds very cleanly to give the corresponding 
imides 2 after protonation with alcohol in good yields, as shown in 
entries 1 and 2 in Table 1. We assume that this successful transforma- 
tion results from a 1,2-metalate rearrangement”, via the initial forma- 
tion of a heterocuprate 7, leading to the copper enolate 1 (Fig. 3). 

To get information on the stereochemistry of the oxidation reaction, 
copper enolate 1 was trapped with triethylchlorosilane. A single geo- 
metrical isomer was obtained for silyl enol ether 8 (Fig. 3) and the 
stereochemistry was determined by X-ray analysis. The oxidation reac- 
tion indeed proceeds with preservation of the stereochemistry of 


Figure 3 | Combined 
carbometalation-oxidation 
sequence. Method A describes the 
addition of oxenoid t-BuOOLi to the 


OSiEt 
MAN Bn vinyl copper 4Cu to give the 
Method A —*N heterocuprate 7. After 1,2-metalate 
MeLi + Me,SeCuBr t-BUOOH + n-BuLi ve 5 oO rearrangement, copper enolate 1 was 
Qo > Me [Cu] 8 formed and could be trapped as 
Mee MeCusSMe, — e | THF, -80 °C R =n-Bu silylenol ether 8. Addition of a protic 
R NJ Et,O, -40 °C R nX ; source leads to the secondary imide 
~ hy? reusen Et, SiCI 2. Method B describes the addition of 
@ Bt 7 45 °C an organocuprate to the ynamide 3 to 
3 4Cu ve _ _ give the dissymmetric cuprate 
t-Bu is cea 7 4Cu(ate). Addition of t-BuOOH to 
eo Nou Lit the reaction mixture leads to 
‘ie Ce = Me ef . deprotonation of the peroxide by the 
= o}_, — 4 more basic organometallic species 
A nA 80°C R NON (that is, alkyl group Me) with 
Ley = dy formation of 7. The same chemical 
Method B R! | L J pathway then proceeds to give the 
; . 7 1 copper enolate 1. The oxidation 
2 MeLi + Me,SeCuBr ‘i 
S ~ Me _ Lit proceeds with a complete 
a _ Me fu yg MeH Protiesoures preservation of the stereochemical 
R of Me CursBreoMey: = A -80 °C information resulting from the syn- 
cual Et,O, -40 °C R LL» t-BuOOH carbometalation, as determined by 
Ri ae 80°C to) Rt the single isomer of the trisubstituted 
A enol silyl ether 8 (the X-ray crystal 
2 4Cufate) an structure of 8 is shown as a stick 
H Me pi é model). 
Oo 
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Table 1 | One-pot formation of imides 2 from ynamides 3 


1) Me,CuLieLiBreSMe, 


Et,O, -40 °C 


t-BuOOH, -80 °C 1°) R! 


re) . 

oo ro 3) Protic source, -80 °C aa * 
ae . — f 

oO 


ea H Me 

R! fe) 

3 2 

Entry R Rl Ynamide Method Protic source Product d.r.* Yieldt (%) 
1 Bu Bn 3a i-PrOH 2a 79:21 50 
2 Hex Bn 3b A i-PrOH 2b 87:13 45 
3 Bu Bn 3a B i-PrOH 2a 86:14 61 
4 Bu Bn 3a B H20 2a 85115 64 
5 Bu Bn 3a B OH 2a 87:13 52 
tBu tBu 

6 Bu Bn 3a B Me2N(CH2)20H 2a 90:10 61 
7 Bu Bn 3a B EtNH(CH2)20H 2a 93:7 64 
8 Hex Bn 3b B EtNH(CH2)20H 2b 90:10 50 
9 Hex i-Pr 3f B EtNH(CH2)20H 2f 92:8 62 
10 i-PrCO2(CHo)2 Bn 3c B EtNH(CH>)20H 2c 90:10 48 
11 AdCO2(CHa)3 Bn 3d B EtNH(CH2)20H 2d 93:7 49 
12 TIPSO(CH>)3 Bn Be B EtNH(CH>)20H 2e 92:8 58 


Methods A and B are given in Fig. 3. Bu, butyl; hex, hexyl; Bn, benzyl; Pr, propyl; Ad, adamantyl; TIPSO, tri-isopropylsilyl ether. 


* Diastereomeric ratios determined by ‘H-NMR spectroscopy and by HPLC analysis. 
+ lsolated yields after column chromatography (based on 3). 


the double bond. The diastereomeric ratio of 2 clearly needs to be 
improved, but we wanted first to develop an experimentally simpler 
approach to the formation of the copper enolate 1. Indeed, the original 
method (method A; Fig. 3) requires the preparation of t-BuOOLi in a 
separate reaction vessel at low temperature, followed by a subsequent 
transfer into the vinyl copper species solution at —80°C. To achieve 
this goal, we developed a new method (method B; Fig. 3) in which the 
carbocupration reaction was performed with an organocuprate 
(Me2CuLieMe.S from 2 equiv. of MeLi and 1 equiv. of CuBreMe,S) 
instead of the previously used organocopper (MeCusMe,S). To this 
organocuprate was added 1 equiv. of ynamide 3; only one of the two 
alkyl groups was regio- and stereoselectively added to the triple bond, 
to give the dissymmetric organocuprate species 4Cu(ate). As an alkyl 
organometallic species is more basic than a vinyl metal species, the 
addition of t-BuOOH should lead to an in situ deprotonation of the 
peroxide by the Me group present on the copper, with liberation of 
MeH and concomitant formation of the heterocuprate 7. The latter 
would undergo a 1,2-metalate rearrangement to give the same copper- 
enolate 1 that would finally react with an electrophile (for example, 
protonolysis). Following this new protocol (method B), better yields 
and improved diastereomeric ratios were obtained, as described in 
Table 1 (compare entries 1 and 3). Now that an easy, safe and repro- 
ducible protocol was set, we decided to further improve the diastereo- 
meric ratio, and various protic sources were tested for the hydrolysis of 
the copper enolate 1, as described in Table 1. 

Addition of water or phenol possessing bulky groups in ortho- 
positions did not improve the diastereomeric ratio (Table 1, entries 4 
and 5 respectively). On the other hand, addition of amino alcohols such 
as N-dimethylaminoethanol (Table 1, entry 6) or, even better, N- 
ethylaminoethanol (Table 1, entry 7), give higher ratios in similar 
yields. If a different oxazolidinone-based chiral auxiliary is used 
(R' = i-Pr), ratios and yields are comparable (Table 1, compare entries 
8 and 9). Our combined sequence was then applied to various functio- 
nalized ynamides (3c-e), possessing ester (Table 1, entries 10 and 11) 
or silyl ether groups (Table 1, entry 12) that led to adducts with good 
diastereoselectivities albeit with lower yields in some cases (but based on 
the starting ynamide after three consecutive chemical steps). The abso- 
lute configuration of the major isomer was determined by chemical 
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correlation with authentic samples for 2a and 2b after transformation into 
the corresponding carboxylic acids (see Supplementary Information). 

Importantly, when trisubstituted enolate 1a was slowly warmed to 
room temperature over a period of one hour and stirred at this tem- 
perature for one additional hour followed by hydrolysis with N- 
ethylaminoethanol at low temperature, the adduct 2a was obtained 
with a similar diastereomeric ratio. Therefore, enolate 1 shows remark- 
able stereochemical stability and no observable isomerization. 

Finally, we turned our attention to the development and study of 
aldol and Mannich-type adducts to validate this long-sought synthetic 
strategy. The sequence is based on our previously optimized protocol, 
as described in Fig. 4 and Table 2. Accordingly, the aldol reaction using 
the previously described conditions (Me,CuLi, f-BuOOH and 
R°CHO), proceeds smoothly to give the aldol adducts 5a-f in good 
overall yields (based on the starting ynamides 3) and diastereomeric 
ratios (Table 2, entries 1-6). The major diastereoisomer was easily 
separated by simple column chromatography on silica gel and the 
absolute configuration of the major isomer was determined by X-ray 
crystallographic analysis of 5a (Supplementary Information). 

The stereochemistry of the major isomer can be rationalized by a 
Zimmerman-Traxler transition state (a six-membered ring transition 
state adopting a chair conformation, Fig. 4 at top right). The benzyl group 
of the oxazolidinone shields one face in the chelated six-membered 
ring, and the aldehyde approaches the enolate anti to this group, with 
its R* substituent in a pseudo-equatorial position. 

The reaction sequence described above allows the preparation of the 
aldol adducts 5 possessing the oxazolidinone chiral moiety, with the 
creation of three new bonds and two new stereogenic centres, includ- 
ing the all-carbon quaternary stereocentre, in a single-pot procedure 
from easily accessible ynamide 3. It is important to note that this 
transformation can be scaled up to larger quantities (that is, 5 mmol 
of 3) without erosion of the diastereomeric ratios and yields, and that 
the experimental conditions are mild enough to avoid retroaldol reac- 
tions. The addition of imines to the stereodefined enolate 1 leads to the 
Mannich adducts 6a-c with similar impressive diastereomeric ratios 
in good isolated yields (Fig. 4 and Table 2, entries 7-9). The stereo- 
chemistry of the adduct 6a is consistent with a Zimmerman-Traxler 
transition state with approach of the imine from the back face of the 
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[ 7 7 Figure 4| Application to the aldol and Mannich 
OH O Bn Je = reactions. Shown is a one-pot reaction en route to 
: Bn” H*N 0) aldol and Mannich adducts (5 and 6 respectively) 
: P y. 
Ph oN via Pi ys =9- -cu7R possessing the all-carbon quaternary stereocentre 
n-Bu Me px n-Bu— =~ through a combined carbometalation of 
° 3-oxidation-addition of electrophiles. Stick 
5a 5a L _| models show X-ray crystal structure determination 
of the absolute configuration of the major isomer 
1.2 equiv. R7CHO 5a and 6a (see Table 2, entries 1 and 7, 
-80 °C, 2h respectively). A mechanistic explanation of the 
then H,O* Bi. 6 5 reaction is as follows. An enolate species 1, formed 
= " by reaction of 4Cu(ate) with the t-BuOOH 
tBu R? + N followed by 1,2-metalate rearrangement, reacts 
ie} . ae Cu R Me with the aldehyde via a six-membered transition 
1) 1.2 equiv. Me,CuLieLiBreSMe. / fe) v 
dee a FeO gs Me A fo) state adopting a chair-like conformation. The 
R= Phcssl D12 aaa BuOOH, -80°C = A 5 oxazolidinone intramolecularly chelates the copper 
3 R NS atom, and thus a second pseudo-heterocycle is 
Bn @ formed, with one face shielded by the benzyl group. 
‘ TsHN  —O Bn y yt group 
3 L Bn | The aldehyde approaches the enolate moiety anti 
1 RS N to this group with its R° substituent in a pseudo- 
R Me p=. equatorial position. When N-sulphonyl imines are 
1°) added to the stereodefined enolate 1, the Mannich 
1.2 equiv. TSN=CHR* 6 adduct 6 is formed through a chair-like transition 
e be state where the sulphonylimine adopts a 
ens Z-configuration anti to the bulky substituent of the 
_ oxazolidinone. 
ie) 
Ts i ~\ ait) 
NH O Bn oe Bn 
H = 
Ph N via ~o~ ‘7 7 
: n-Bu ‘a eos 
n-Bu Me O =SN.- 
Ph ‘37 
2 a 
Me 0” \ tot 
6a 6a Pie 


bulky benzyl group of the oxazolidinone. As the barrier of planar 
inversion of N-sulphonyl imines (E to Z) is low, steric factors may 
account for the formation of the Z-isomer in the six-membered trans- 
ition state”. Cleavage of the oxazolidinone moiety of 5 could be per- 
formed using standard and reliable high-yielding transformations 
(with recovery of the oxazolidinone-based chiral auxiliaries, see 
Supplementary Information) and the aldehyde 11a can be obtained 
diastereomerically and enantiomerically pure, as shown in Fig. 5. 
These results demonstrate that alternative approaches to the pre- 
paration of all-carbon quaternary stereocentres and more particularly 
to the formation of aldol and Mannich adducts can be designed from 


simple alkynes through the concomitant formation of several new 
bonds in a single-pot operation. This study illustrates that even the 
most challenging problems can be efficiently addressed when new and 
powerful synthetic tools are provided. Owing to its versatility, our 
approach will surely find a large number of applications for the trans- 
formation of heterosubstituted alkynes into stereodefined trisubsti- 
tuted enolates that may subsequently react with a large variety of 
electrophiles, leading to a new and general approach to the formation 
of quaternary all-carbon stereocentres”’. This study could be extended 
to include other auxiliaries and, more importantly, achiral groups for 
the development of catalytic strategies. 


Table 2 | One-pot formation of products possessing the all-carbon quaternary stereocentre from ynamides 


1) 1.2 equiv. Me,CuLieLiBreSMe, 


1) 1.2 equiv. Me,CuLieLiBreSMe, 


TsHN  O Bn Et,O, -40 °C 0 Et,O, -40 °C OHO Bn 
‘ss - _2) 1.2 equiv. t-BuOOH, -80 °C __ re 2) 1.2 equiv. t-BuOOH, -80 °C . aa 
ea Me So} 3) 1.2 oat TsN=CHR%, -80 °C, 2h : 3) 1.2 aay, R°CHO, -80 °C, 2h ‘ R ite 6 
O then H,O Bn then H,O Oo 
6 3 5 
Entry R R? Ynamide Product din* Yield+ 
(%) 
1 Bu Ph 3a 5a 93:2:4:1 50 
2 Bu p-BrCgHa 3a 5b 90:2:5:3 52 
3 Bu p-MeCgHa 3a 5c 90:1:4:5 54 
4 Hex Ph 3b 5d 94:1:2:3 54 
5 Bu p-MeOCOC.¢Hy, 3a 5e 93:3:1:3 50 
6 Bu 2-Naphthyl 3a 5f 95:2:2:1 62 
7 Bu Ph 3a 6a 93:5:2:0 50 
8 Bu p-MeCgHa 3a 6b 94:3:2:1 58 
9 Bu p-BrCgHa 3a 6c 94:5:1:0 45 


5a-f, aldols; 6a-c, Mannich products. 
* Diastereomeric ratios determined by 'H-NMR spectroscopy for the four possible diastereomers. 
+ Isolated yields after column chromatography (based on 3). 
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1) (MeO),CH,, P,0,, 
CHCl, 15 min (9a, 93%) 


OH O F 
: Bn 2) LiBH,, MeOH, MeO* ~O OO 
mye wk Et,O, RT, 1 h (10a, 90%) = 
> > 
io 3) Dess-Martin periodinane Ph : H 
n-Bu) Me on < 
© fe) CH,Cl,, H,O (cat.), n-BuU Me 
RT 30 min, (96%) 
5a 11a 
dir. > 9951 
e.r. > 99:1 


Figure 5 | Cleavage of the auxiliaries. Transformation of the aldol adduct 
5a into 11a after subsequent protection (step 1)-cleavage (step 2) and oxidation 
(step 3) reactions. Dess-Martin periodinane is an oxidant. RT, room 
temperature; d.r., diastereomeric ratio; e.r., enantiomeric ratio. 


METHODS SUMMARY 


General procedure for the one-pot formation of aldol adduct 5a. CuBreDMS 
complex (1.20mmol, 1.2 equiv.) was placed into a flame-dried three-necked 
round-bottomed flask equipped with a magnetic stirring bar and connected to 
an argon line. Dry diethyl ether (8.0 ml) was added, and the system was cooled to 
—50°C. A solution of methyllithium (1.60M in diethyl ether, 2.40 mmol, 
2.4 equiv.) was added dropwise, and the reaction mixture was allowed to warm 
to —40 °C and stirred for 30 min at this temperature to form a solution of lithium 
dimethylcuprate complex. The reaction mixture was cooled to —50°C, and a 
solution of the starting alkynyl carbamate 3 (1.00 mmol, 1.0 equiv.) in 1.5 ml of 
diethyl ether was added dropwise. The reaction mixture was allowed to warm to 
—40°C and stirred at this temperature for 1.5h (monitored by thin-layer chro- 
matography (TLC) using 20% ethyl acetate in hexane as eluent) to form a pale- 
yellow clear solution of vinylcuprate. Upon completion, the reaction mixture was 
cooled to —90 °C, and a solution of t-butylhydroperoxide in nonane (1.20 mmol, 
1.2 equiv.) was added in one fast injection using a syringe. The resulting reaction 
mixture was allowed to warm up slowly to —80 °C, and stirred at the indicated 
temperature for 1 h to form a green-brown opaque solution. Upon completion of 
the oxidation step, freshly purified aldehyde was added at —80 °C to the reaction 
mixture and was allowed to stir for additional 1.5h keeping the temperature at 
—80 °C. The reaction was quenched with an aqueous saturated solution of NH4Cl 
and allowed to warm up to room temperature. Phases were separated, and the 
aqueous phase was extracted three times with diethyl ether (15 ml). The combined 
organic phases were dried over anhydrous NaSO,, filtered, and concentrated 
under reduced pressure. Purification was accomplished by silica gel flash column 
chromatography (gradient of solvents 15-30% of diethyl ether in hexane, TLC 
analysis using 1:1 diethyl ether-hexane). 

General procedure for the one-pot formation of Mannich adduct 6a. The 
preparation of lithium dimethyl cuprate complex followed by the carbometallation 
and oxidation were performed according to the developed general procedure 
described above. Upon completion of the oxidation step, 1.2 equiv. of tosylimine 
(0.6 M solution in dry THF) were added to the reaction mixture at —80°C. The 
reaction mixture was allowed to stir for additional 2h keeping the temperature at 
—80 °C (progress monitored by TLC using 8:2 hexane-ethyl acetate as an eluent). 
The reaction was quenched with an aqueous saturated solution of NH4Cl and allowed 
to warm up to the ambient temperature. Phases were separated, and the aqueous 
phase was extracted three times with diethyl ether (15 ml). The combined organic 
phases were dried over anhydrous Na2SO,, filtered, and concentrated under reduced 
pressure. Purification was accomplished by silica gel flash column chromatography 
(10% of ethyl acetate in hexane, TLC analysis using 8:2 hexane-ethyl acetate). 
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Recent changes to the Gulf Stream causing 
widespread gas hydrate destabilization 


Benjamin J. Phrampus' & Matthew J. Hornbach! 


The Gulf Stream is an ocean current that modulates climate in the 
Northern Hemisphere by transporting warm waters from the Gulf 
of Mexico into the North Atlantic and Arctic oceans’”. A changing 
Gulf Stream has the potential to thaw and convert hundreds of 
gigatonnes of frozen methane hydrate trapped below the sea floor 
into methane gas, increasing the risk of slope failure and methane 
release*°. How the Gulf Stream changes with time and what effect 
these changes have on methane hydrate stability is unclear. Here, 
using seismic data combined with thermal models, we show that 
recent changes in intermediate-depth ocean temperature asso- 
ciated with the Gulf Stream are rapidly destabilizing methane 
hydrate along a broad swathe of the North American margin. 
The area of active hydrate destabilization covers at least 10,000 
square kilometres of the United States eastern margin, and occurs 
in a region prone to kilometre-scale slope failures. Previous hypo- 
thetical studies** postulated that an increase of five degrees Celsius 
in intermediate-depth ocean temperatures could release enough 
methane to explain extreme global warming events like the 
Palaeocene-Eocene thermal maximum (PETM) and trigger wide- 
spread ocean acidification’. Our analysis suggests that changes in 
Gulf Stream flow or temperature within the past 5,000 years or so 
are warming the western North Atlantic margin by up to eight 
degrees Celsius and are now triggering the destabilization of 2.5 
gigatonnes of methane hydrate (about 0.2 per cent of that required 
to cause the PETM). This destabilization extends along hundreds 
of kilometres of the margin and may continue for centuries. It is 
unlikely that the western North Atlantic margin is the only area 
experiencing changing ocean currents’® ’; our estimate of 2.5 giga- 
tonnes of destabilizing methane hydrate may therefore represent 
only a fraction of the methane hydrate currently destabilizing 
globally. The transport from ocean to atmosphere of any methane 
released—and thus its impact on climate—remains uncertain. 

Methane hydrate, a solid consisting of methane and water, is stable 
at high pressures and low temperatures. Owing to a positive thermal 
gradient in the Earth, methane hydrate exists only within the first few 
hundred metres of sediments in deep marine settings, below which 
methane gas and liquid water are stable'’. Methane hydrates represent 
one of the largest reservoirs of organic carbon on Earth®'*"*. Studies 
speculate that destabilization of methane hydrates could inject signifi- 
cant quantities of methane into the ocean and possibly the atmosphere, 
leading to spikes in atmospheric carbon levels***. 

The base of hydrate stability can sometimes be detected directly in 
seismic data via bottom-simulating reflectors (BSRs) that appear as 
strong, negative-polarity, high-impedance seismic reflections caused 
by free gas at the base of the phase boundary’*"® (Fig. 1b). Hydrate 
formation is strongly dependent on temperature, and because of this, 
the base of the hydrate stability zone, as indicated by BSRs, is used for 
estimating subsurface temperature”. 

Warming waters in the Gulf Stream can potentially destabilize 
methane hydrate’. Additionally, slight changes in the Gulf Stream flow 
direction can also destabilize methane hydrate by introducing warm 
waters to regions previously exposed only to cold bottom-water currents. 


The Gulf Stream consists of anomalously warm water at depths as great 
as 1,000 metres below sea level (m.b.s.; Fig. 2a). In regions where the 
Gulf Stream is absent, ocean temperatures are markedly colder at inter- 
mediate water depths (300-1,000 m.b.s.L; Fig. 2a). 

Although much is known about the current state of the Gulf Stream, 
relatively little is known about the evolution of the Gulf Stream before 
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Figure 1 | Gas hydrates below the Gulf Stream in the western North 
Atlantic. a, Study area, shown boxed in the map-view inset, where the location 
of the Gulf Stream is also shown (dashed black lines), flowing along the western 
edge of the North Atlantic margin’. In the main figure, the grey area denotes 
where BSRs exist below the sea floor”; the pink area is where methane hydrate is 
destabilizing owing to recent changes in ocean temperature; and the 
approximate location of the Gulf stream is between the two solid black arrows. 
b, Multi-channel seismic line 80.A is one of several seismic lines in the region 
showing clear BSRs shoaling westward along the edge of the continental 
margin”. The rectangle indicated in white is shown magnified in the inset. 
Inset, the gas hydrate phase boundary (that is, BSR) shows as a strong, negative 
polarity reflector (black arrows) that behaves erratically with depth beneath 
sea-floor less than ~1,000 m.b.s.1. 
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Figure 2 | Evidence for recent changes in ocean temperature and hydrate 
destabilization. a, Two different ocean temperature regimes exist in the 
western North Atlantic, with Gulf Stream temperatures as much as 8 + 1.1°C 
(lo error) warmer at intermediate water depths than ocean temperatures 
outside the Gulf Stream. Seismic line 80.A was acquired within the Gulf Stream, 
yet the predicted BSR depth using present day (Gulf Stream) ocean 
temperatures is shallower than observed BSR depths for sea-floor depths less 
than ~1,000 m.b.s.l. (b). Comparison between observed and predicted BSR 


the Holocene epoch. Lynch-Stieglitz et al.’? and Lund et al.'* suggest a 
lower-flux, cooler Gulf Stream during cooler climates. Alternatively, 
Winguth et al.’, using computer simulations, suggest that during warmer 
climates, warm ocean temperatures like those of the Gulf Stream prevail, 
resulting in methane hydrate destabilization. 

Where ocean temperature is well-constrained, comparison between 
the predicted and the actual base of methane hydrate stability offers a 
valuable means of identifying recent changes in ocean temperature and 
sub-sea-floor temperature. Here we analyse the stability of methane 
hydrates along the Carolina rise off the east coast of North America 
using active-source seismic reflection data, with the goal of charac- 
terizing hydrate stability below the Gulf Stream. The Carolina rise is a 
region where both the Gulf Stream and methane hydrate stability are 
well-constrained’'*’”*' (Fig. 1). Seismic data show clearly observable 
BSRs imaged continuously below the sea floor at water depths of 
800-4,000 m.b.s.l. throughout the region”’”* (Fig. 1b, Supplementary 
Information). At water depths shallower than ~1,000 m.b.s.L, the BSR 
shoals owing to lower hydrostatic pressures and warmer ocean tem- 
peratures, resulting in a vertically thinner hydrate stability zone within 
the sediments (Fig. 1b). 

By comparing the predicted base of the hydrate stability zone using 
current hydrological and geological conditions with the observed base 
of the hydrate stability zone, we determine if differences exist between 
the observed and predicted temperature regime. Results show that at 
depths greater than ~ 1,000 m.b.s.L., the model-predicted BSR matches 
observed BSR depths in seismic data (Fig. 2b). At shallower depths, 
however, model predictions diverge from observations, with the 
observed BSR consistently deeper than model predictions. Even after 
accounting for end-member uncertainty (to 2c), the observed BSR is 
deeper than model predictions (Fig. 2b). 

Many phenomena might explain the difference between observed 
and predicted BSR depths, including changing sea level, gas composi- 
tion, fluid flow, variable sedimentation rates, changes in heat flow, 
seismic velocity model errors, and variations in ocean temperature. 
Changing sea level affects hydrate stability by altering the hydrostatic 
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depths for the two ocean temperature regimes indicate cold, non-Gulf Stream 
intermediate ocean temperatures produce a much better fit to observed BSR 
depths (c). This implies recent ocean warming or northwest intrusion of the 
Gulf Stream along the Carolina rise and the onset of methane hydrate 
destabilization along the margin. This observation is widespread 
(Supplementary Information). Dashed yellow lines indicate where the BSR is 
inferred but difficult to identify clearly in seismic data. 


pressure regime; however, sea level in the early Holocene was lower, 
resulting in lower pressure and shallower BSRs. Holocene changes in 
sea level cannot therefore explain the anomalously deep BSRs observed 
today. A consistent increase in the contribution of thermogenic gas 
along the margin could also cause the phase boundary to deepen; 
however, geochemical measurements indicate natural gas concentra- 
tions are consistently 99% methane across the region’””***. Fluid flow, 
by transporting heat more efficiently via advection, can have a signifi- 
cant effect on temperature; however, the anomalously deep hydrate 
stability zone observed along the edge of the margin requires down- 
ward flow, which is inconsistent with regional fluid models”. Rapid 
sedimentation can also result in anomalously cool shallow sediment 
temperatures**. However, sedimentation in this region has not exceeded 
30m Myr ' since the Pleistocene epoch, and modelling indicates that 
the regional sedimentation rate is too low to explain the discrepancy 
between observed and predicted BSR depth”*’®. Lateral variations in 
heat production can also vary heat flow and temperature. Previous 
studies across this region suggest no significant variations in heat flow 
or radiogenic material across the site’*”°’”. Indeed, an average heat flow 
of 19mWm ~ is necessary to explain the anomalously deep BSR at 
the margin edge. This is a factor of two below regional observations”. 
Finally, anomalously high seismic velocities used to create the seismic 
depth section could also result in anomalously deep BSRs. The seismic 
velocities necessary to offset observed BSR depths to model predictions 
are physically unreasonable’®; a compression-wave velocity averaging 
less than 250ms | in sediments above the BSR is needed to explain 
the discrepancy. This leaves ocean temperature variations as the last 
plausible explanation for the discrepancy between the observed and 
predicted BSR. Specifically, if ocean temperatures recently warmed at 
intermediate water depths, it would take time for heat to propagate 
downward. Before thermal equilibration, the observed BSR would 
therefore be cooler and appear anomalously deep compared to model 
predictions. 

To test whether cooler intermediate-depth ocean temperatures in 
the recent past might explain the discrepancy between the observed 
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and predicted depth of methane hydrate stability along the Carolina 
rise, we re-ran the heat-flow model but removed the warming effects of 
the Gulf Stream. Instead, we apply as our top boundary condition a 
mid-Atlantic temperature-depth profile acquired outside the Gulf 
Stream (Fig. 2a). Model results show a remarkably good fit between 
observed and predicted BSR depth for all depths (Fig. 2c). 

Warming of the intermediate-depth ocean within the Holocene 
provides a simple explanation for the discrepancy between the 
observed and predicted base of the methane hydrate phase boundary 
(Fig. 3). The analysis implies that the Gulf Stream was cooler, lower 
flux, or recently diverted northwest during the Holocene, consistent 
with previous studies'*”°"*. The result also implies that recent changes 
in ocean temperature are causing contemporary destabilization of 
methane hydrates along the North American margin. 

Analysis of BSRs in other regional seismic data suggests that recent 
ocean bottom warming is ubiquitous along the Carolina rise. In addi- 
tion to line 80.A, we analysed two other published seismic lines acquired 
in the Gulf Stream along the North Atlantic margin that show clearly 
identifiable BSRs” (Supplementary Information). Both lines are located 
more than 40-50 km away from line 80.A, yet results from all three 
seismic lines are identical. Specifically, analysis of all three seismic 
lines suggests that BSRs located below sea-floor depths shallower than 
~1,000 m.b.s.l. are unstable along the Carolina rise, and that recent 
intermediate ocean temperature warming is widespread along the 
margin just east of North Carolina. 

Other studies support the idea that shifting ocean currents and sig- 
nificant (>1°C) Holocene ocean temperature changes occur in this 
region’*’. During the twentieth century, the most significant measured 
sea surface temperature warming in the North Atlantic Ocean occurred 
above the Carolina rise’. Ocean surface warming above the Carolina 
rise has been attributed”® to either a northwest shift in the Gulf Stream 
flow direction or a warming Gulf Stream, as we suggest. Analysis of 
ocean drilling data also indicates recent (Holocene) changes in the Deep 
Western Boundary Current along the edge of this margin”. 

Extrapolating our observations across the area where the Gulf 
Stream and known methane hydrate deposits exist, we place first-order 
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Figure 3 | Timing methane hydrate dissociation along the western Atlantic 
margin. To place first-order constraints on the timing of ocean temperature 
warming and methane hydrate dissociation along line 80.A, we run a time 
dependent diffusive heat-flow forward model in which we start initially with 
our steady-state cold ocean temperature model result (Fig. 2c), but 
instantaneously change the ocean temperature boundary condition to present 
day (Gulf Stream) values. The resulting forward model shows evolution of 
hydrate stability and BSR depth through time. Each coloured line represents a 
model-predicted BSR location in the future. These results indicate that 
dissociation will continue for the next several thousand years if no change in 
ocean temperature occurs. The pink zone represents the total area of 
destabilizing methane hydrate. 
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constraints on the area of hydrate destabilization (Fig. 1). Our analysis 
suggests that we are observing the onset of methane hydrate desta- 
bilization along a ~300-km span of the North American margin that 
will continue for centuries unless the Gulf Stream shifts southward or 
intermediate ocean temperatures cool several degrees'*’®*" (Fig. 3). 
We estimate that hydrates are currently destabilizing within a volume 
of ~7.5 X 10'' m®*. Assuming an average porosity of 60% in the shallow 
sediments where hydrates are destabilizing, hydrate filling 5% of the 
pore space, and 123kg of methane per cubic metre of hydrate, we 
estimate that ~2.5Gt of methane—or ~0.2% of that necessary to 
explain the PETM—are currently destabilizing beneath a sea-floor area 
of ~10,000 km? off the US eastern seaboard’. If continuing hydrate 
destabilization triggers slope failure at this site, the amount of methane 
released could be an order of magnitude greater’®. Hydrate destabiliza- 
tion, by converting solid methane hydrate into methane gas and water, 
can elevate pore fluid pressure along the slope edge, reducing slope 
stability in areas prone to frequent, potentially tsunamigenic slope 
failure**”’*. It is perhaps no coincidence that the upper headwall 
of the Cape Fear slide, the largest submarine slide complex along the 
western North Atlantic margin, resides within the area of active 
hydrate destabilization (Fig. 1). 

Recent shifts in Gulf Stream flow or temperature provide a simple 
yet powerful mechanism for contemporary methane hydrate dissoci- 
ation and carbon release. The analysis presented here provides a 
method for constraining Holocene changes in intermediate-depth 
ocean temperatures and also demonstrates that slight deviations in 
ocean currents have a profound impact on margin stability and the 
ocean carbon budget. It is unlikely that the western North Atlantic 
margin is the only area experiencing widespread hydrate destabiliza- 
tion due to changing ocean currents. Recent studies have suggested 
that similar ocean temperature shifts may occur both in the Arctic 
Ocean’*””* and globally along subtropical western boundary currents”. 
Our estimate of 2.5 Gt of destabilizing methane hydrate may therefore 
represent only a fraction of the methane hydrate currently destabilizing 
globally. 


METHODS SUMMARY 


To test whether methane hydrates below the Carolina rise are stable and tempera- 
tures are in steady state, we developed a 2D finite difference diffusive heat-flow 
model, discretized into 20 m (vertical) X 50 m (horizontal) cells, that accounts for 
conductivity, temperature, bathymetry and depth along depth-converted seismic 
line 80.A”*”°. To constrain ocean temperature with depth, we average nine conduc- 
tivity-temperature-depth (CTD) measurements near the site (Fig. 2a). We estimate 
heat flow at this site using the method outlined in ref. 17 that calculates heat flow 
from deep-water (>2,000 m.b.s.1.) BSRs located below the Gulf Stream thermocline. 
For thermal conductivity, we use a value of 1 W m !K71) consistent with regional 
measurements’’”°”*. We calculate an average heat flow of 40.7 + 4.6 mW m 7 (20), 
consistent with regional conductive heat-flow measurements and borehole thermal 
data collected near this site'’*°”’. 

With conductivity, sea-floor temperature, sea-floor shape and heat flow con- 
strained, we calculate the 2D steady-state temperature distribution within the 
marine sediments using a finite-difference diffusive heat flow approach. We calcu- 
late thermal diffusivity assuming constant thermal conductivity of 1 Wm~!K7!, 
an average bulk density of 1,700kgm™*, and a specific heat capacity of 
2,500JK ! kg” | consistent with drilling results'**°**. We hold heat flow constant 
with time at the bottom boundary and sea-floor temperature constant with time 
and variable with depth, consistent with CTD values. We estimate temperature 
uncertainty by integrating end-member uncertainty values into the model. We 
calculate the base of the hydrate stability zone and location of the BSR by integrating 
heat flow results with the CSMGem hydrate stability program** assuming hydro- 
static conditions and a constant salinity of 34.9%o, as measured across the region”. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Data preparation. We obtained line 80.A seismic data and navigation files from 
the UTIG marine seismic data repository. The data, originally collected in 1977”, 
were provided as a stacked time section in SEGY format. Original processing of the 
seismic data included demultiplexing, CDP sorting and velocity analysis using sem- 
blance techniques, normal moveout correction, 24-fold stacking, time-variable gain, 
and band-pass filtering. Because the data supplied are post-stack, we used velocity 
analysis of CDP gathers from long-offset regional pre-stack seismic data acquired 
near Blake ridge in 2000 to estimate seismic velocities in the shallow (<500 m.b.s.1.) 
subsurface. Interval velocities at depths <500 m.b.s.1. consistently average just above 
~1,500 ms, and we used these interval velocities to convert line 80.A from time to 
depth. Additionally, to convert the seismic section to depth, we incorporated con- 
ductivity-temperature-depth (CTD) data to account for the effects of ocean tem- 
perature and conductivity on sound speed with depth*®. The depth conversion 
enabled us to estimate sea-floor and BSR depth with metre-scale accuracy. 

Heat flow model. With the 2D seismic depth profile for line 80.A constrained, we 
create the domain for the 2D conductive heat flow model by picking the depth of 
the sea floor along the profile using standard seismic interpretation software. We 
export sea-floor picks into Matlab, and digitize them with a vertical resolution of 
20 m and horizontal resolution of 50 m. This is the approximate resolution of the 
seismic data and represents the standard grid-size resolution we use to run the 2D 
heat flow model. 

Next, we create a 1,500 X 255 cell thermal conductivity grid (with 50m X 20m 
cell dimensions), in which we assign a conductivity value of 1 W m !K7! forall 
depths below the sea floor, consistent with regional measurements’’”°”*. The base 
of the conductivity model is such that the basal boundary locations are greater than 
or equal to 3 km below the sea floor. We calculate thermal diffusivity using the 
conductivity grid by assuming a constant thermal conductivity of 1|Wm7'K~', 
an average density of 1,700 kgm * for the sediments and a specific heat capacity of 
2,500JK ! kg, consistent with drilling results!?”°”*. 

We also create a 1,500 X 255 cell temperature grid. To define the values for the 
initial conditions and the boundary conditions of the temperature grid, we use 
previously published regional heat flow measurements as well as independent heat 
flow calculations from deep BSRs to constrain heat flow across line 80.A‘7?7°**, 
We calculate the regional heat flow using BSRs in the deep water environment 
(>2,000 m.b.s.1.) where little variability in bottom water temperature exists by 
assuming (1) a hydrate stability pressure-temperature curve for pure methane, 
(2) a salinity of 34.9%o and (3) hydrostatic conditions. We use the CSMGem 
program to constrain hydrate stability for these conditions**. The heat flow value 
we obtain using this method of 40.7 + 4.6 mW m ” (2c) is consistent with inde- 
pendent regional conductive heat-flow measurements and borehole thermal data 
collected near this site’?”°”*. 

We define our initial starting-model temperature grid by first assuming a 1D 
linear temperature gradient (that is, temperature increasing linearly with depth 
below the sea floor), where we use two different ocean temperature regimes (inside 
the Gulf Stream versus outside the Gulf Stream) as the top boundary condition at 
the sea floor for our two different models. To constrain ocean temperatures at the 
sea floor beneath the Gulf Stream, we used nine CTD profiles located near our 
study area obtained from the World Ocean Circulation Experiment (WOCE). 
Using these sites, we found temperature versus depth below the Gulf Stream 
had an average standard deviation of +1.1 °C. For sea-floor temperatures outside 
the Gulf Stream, we used the closest WOCE CTD measurements available near our 
study area outside the Gulf Stream. As an additional cold water test, we used CTDs 
from the tropical eastern Pacific where warm surface waters exist but no Gulf 
Stream exists. CTD temperature-depth profiles for the eastern Pacific and non- 
Gulf Stream Atlantic were consistent to within ~1.5 °C, resulting in minimal 
difference in model results. For each of the models shown in Fig. 2, sea-floor 
temperature is held constant in time but is variable with depth, consistent with 
measured temperature-depth profiles. With the top boundary condition con- 
strained by CTD data, we assume open side boundary conditions, but hold the 
top boundary and basal boundary temperature constant in time. 

With thermal conductivity and temperature at the boundary conditions con- 
strained, we run the 2D finite difference heat flow model. The 2D model assumes a 
time-dependent, conductive heat flow regime with constant diffusivity, and no 
significant in situ generation of heat. It therefore uses the following equation to 
solve for temperature changes in time and space: 


OT (x,y,t) (G o) 
K | 


(1) 


Here, 0 is a partial derivative, T is temperature, t is time, x is the diffusivity, x is 
position in the x-direction, and y is position in the y-direction. To solve this 
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numerically, we use a finite-difference technique where we approximate and 
expand the space derivatives to second order such that: 
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Where Tey is the temperature at time ¢ and cell position (x, y); T'x+ iy is the 
temperature at time t and cell position (x + 1, y); T’.— iy is the temperature at time 
t, and cell position (x — 1, y); Tt is the temperature at time t and cell position 
(x, y + 1); and Tayi is the temperature at time f and cell position (x, y — 1). Here, 
dx and dy are approximated as Ax = 50 m and Ay = 20 m, respectively, as defined 
by our grid size. We solve the time dependent heat flow equation using a semi- 
implicit (forward-time centre-space) technique, noting we that we can approx- 
imate the change in temperature with time at any given cell as: 


t+1_o7t 
atey,t) _(T3'—Ty) rr 
Ot At 


where T‘* oe is the temperature at cell (x, y) at the next time step (tf + 1), and Atis 
the time step. To maintain numerical stability, we set At less than Ay”/(2x). We 
iteratively determine how temperature changes in time and space below the sea 
floor at line 80.A by substituting equations (2), (3) and (4) into equation (1) and 
solving for T'*! such that: 
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To calculate steady-state temperatures for each of the models presented in Fig. 2, 
we ran the heat flow model for 3 million years, using the two different ocean 
temperature regimes for each model. We estimate temperature uncertainty for 
Fig. 2 by re-running the model using end-member (2¢) uncertainty values for both 
ocean temperature and heat flow. 

To calculate the expected temperature change with time shown in Fig. 3, we 

took the steady-state solution for the cold ocean temperature regime as our start- 
ing model but set the upper boundary condition to warm Gulf Stream tempera- 
tures values. This model therefore assumes an instantaneous ocean temperature 
change. We then ran the model for 10,000 years, outputting temperature results at 
100-year time intervals. 
CSMGem hydrate stability model. To calculate the location where the base of 
hydrate stability (and therefore, the BSR) exists below seismic line 80.A, we integ- 
rate our heat flow model results with the CSMGem program”. The CS¢MGem 
program generates precise and accurate hydrate stability phase diagrams that 
account for natural gas composition, concentration of inhibitors of hydrate forma- 
tion (such as salt), and variable pressure and temperature regimes. 

With steady-state temperature constrained below the sea floor via 2D numerical 
modelling results, we determine the locations of hydrate stability below the sea 
floor by estimating pressure at each cell in the model. For pressure, we assume both 
hydrostatic and lithostatic conditions. For hydrostatic conditions, we assume a 
water density of 1,040 kg m °; for lithostatic conditions, we assume an average 
grain density of 2,700 kgm *. We assign both lithostatic and hydrostatic pressure 
values to each cell in the temperature grid. 

To use the CSMGem model to constrain where hydrate forms, we assume pure 
methane gas and a constant salinity of 34.9%b, in accordance with previous studies 
near this site’’*°’?, We now have constraints on chemistry, temperature (from 2D 
model results), and pressure (hydrostatic and lithostatic), and can calculate, based 
on CSMGem output results, where hydrate is stable for any given pressure or 
temperature value. 

With the hydrate stability phase boundary known from CSMGem results, we 
interrogate each cell location in the temperature and pressure grids to determine if 
the pressure and temperature conditions are appropriate for hydrate stability at 
each cell. Cells where hydrate is stable are denoted in a new hydrate stability grid 
with the value “1”. At cells where hydrate is unstable, we fill the hydrate stability 
grid with a value of “0”. The program then searches through the hydrate stability 
grid and auto picks the location of BSRs by finding where transitions occur 
between cell values of 1 and 0. The hydrate stability grid therefore produces the 
model results that are overlain on the seismic data in Figs 2 and 3. 
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The origin of the terrestrial noble-gas signature 


Svyatoslav S. Shcheka! & Hans Keppler’ 


In the atmospheres of Earth and Mars, xenon is strongly depleted 
relative to argon, when compared to the abundances in chondritic 
meteorites”. The origin of this depletion is poorly understood* "’. 
Here we show that more than one weight per cent of argon may be 
dissolved in MgSiO, perovskite, the most abundant phase of Earth’s 
lower mantle, whereas the xenon solubility in MgSiO, perovskite is 
orders of magnitude lower. We therefore suggest that crystallization 
of perovskite from a magma ocean in the very early stages of Earth’s 
history concentrated argon in the lower mantle. After most of the 
primordial atmosphere had been lost, degassing of the lower mantle 
replenished argon and krypton, but not xenon, in the atmosphere. 
Our model implies that the depletion of xenon relative to argon 
indicates that perovskite crystallized from a magma ocean in the 
early history of Earth and perhaps also Mars. 

The relative abundance of noble gases in Earth’s atmosphere is 
largely controlled by the early evolution of our planet. Most of the 
argon in Earth’s atmosphere is the isotope *°Ar, which is a radioactive 
decay product of “°K. However, the abundances of ‘primordial’ (non- 
radiogenic) noble-gas isotopes also differ among the Sun, the terrestrial 
planets and chondritic meteorites'*. Both chondritic meteorites and 
terrestrial planets are strongly depleted in the light noble gases helium 
and neon compared to the Sun. This feature can readily be explained by 
the loss of light noble gases to space in a relatively weak gravitational 
field. However, compared to chondrites, which are believed to be 
similar to the material from which the terrestrial planets formed, xenon 
is strongly depleted on Earth and Mars relative to argon and krypton. 
This depletion cannot be explained by loss to space, because xenon, the 
heaviest of all noble gases, should be least affected by this process. 

Therefore, a number of alternative hypotheses has been proposed to 
account for the ‘missing xenon’. One class of models assumes that 
xenon is present on Earth in its expected chondritic ratio relative to 
argon and krypton and that the xenon is hidden in some near-surface 
reservoir, such as glacier ices, clathrates, sediments or silicates*®. 
However, such a xenon-rich reservoir has never been found in nature. 
Another class of models has suggested that xenon may be concentrated 
in Earth’s core, either in solid solution with iron or possibly as a separate 
phase”®. Recent experiments, however, did not find any evidence for 
dissolution of xenon in iron at core pressures, although the full range of 
pressures and temperature relevant to core formation has not yet been 
explored’. A third class of models explains the terrestrial noble-gas 
abundance by a combination of hydrodynamic escape to space and 
mantle degassing, but these models do not provide a simple physical 
explanation for the depletion of xenon relative to argon in the atmo- 
sphere either*”®. Alternative hypotheses include the delivery of volatiles 
to Earth by comets that may be depleted in xenon” or the preferential 
loss of xenon to space owing to its low ionization potential'*”’. 

The lower mantle is the largest reservoir on Earth, with a mass exceed- 
ing 50% of the mass of the entire planet. Surprisingly, the potential role of 
the lower mantle in producing the terrestrial noble-gas signature has 
never been investigated. We have therefore studied the solubility of 
noble gases (argon, krypton, and xenon) in MgSiO3 perovskite, the 
major phase of the lower mantle. Although it is widely believed that 
noble gases are very poorly soluble in upper-mantle minerals”, there 
are structural considerations suggesting that this may be different for 


the perovskite phase. MgSiO3 perovskite may contain abundant oxy- 
gen vacancies'**!. These oxygen vacancies may occur charge-balanced 
by Si vacancies at low silica activity, such as in equilibrium with the 
ferropericlase (Mg,Fe)O phase in the lower mantle. The substitution of 
Al’* for Si** may also produce oxygen vacancies. Given that the 
atomic radius of argon is not far away from the ionic radius of the 
O7~ ion, oxygen vacancies in perovskite may provide suitable storage 
sites for noble-gas atoms of appropriate radius. 

Experimental results on noble-gas solubility in MgSiO3 perovskite 
are compiled in Supplementary Table 1. The experiments with argon 
yielded MgSiO; perovskite with up to more than 1 wt% of argon, whereas 
the xenon solubility was nearly 300 parts per million (p.p.m.), even if 
excess xenon was clearly present during the experiment (Fig. la). Data 
for krypton overlap with the argon data, but tend to be somewhat 
lower. The high argon concentrations are a characteristic property of 
the perovskite phase. This can clearly be seen in Fig. 1b, which shows 
a sample in which argon-saturated MgSiO; majorite (with garnet 
structure) coexists with MgSiO; perovskite. No argon is found in the 
majorite (detection limit 150 p.p.m.), while the coexisting perovskite 
contains about 3,000 p.p.m. of argon. In some samples argon solubility 
is so high that the distribution can be visualized by X-ray mapping 
(Fig. 1c and d), although the instability of perovskite under the electron 
beam limits the quality of the images that may be obtained. Again, 
these maps show that the high argon solubility is restricted to the 
perovskite phase. In the course of this study, numerous phases were 
produced by accident because the experiments did not reach the per- 
ovskite stability field. In none of these phases, which included MgSiO3 
ilmenite, SiO, stishovite, Mg,SiO, ringwoodite, and MgO periclase 
was any argon ever detected. Careful study of the perovskite samples 
by scanning and transmission electron microscopy did not reveal any 
impurities or foreign phases, such as gas bubbles, that may account for 
the observed argon content. We therefore conclude that the argon 
must be incorporated into the crystal lattice of the perovskite phase. 
This is consistent with some preliminary single-crystal X-ray diffrac- 
tion data. The Ar-bearing perovskite shows some additional, weak 
reflections that are consistent with the symmetry of the phase, but 
do not occur in argon-free samples. Subtle, but systematic differences 
in reflection intensity were also observed. A detailed structure refine- 
ment was not possible owing to the large width of the X-ray reflections. 
However, the broadening of the reflections (0.5° Q half width as opposed 
to 0.07° for pure MgSiO; perovskite; MoK, radiation) indicates some 
internal strain that itself is suggestive of argon being incorporated in the 
crystal lattice of the perovskite. Moreover, in some samples, high argon 
contents appear to correlate with low Si/Mg ratios. This is consistent 
with argon being incorporated in oxygen vacancies, which are charge- 
balanced by Si vacancies. 

If argon primarily substitutes for oxygen vacancies in perovskite, the 
low solubility of xenon is probably related to the size mismatch between 
xenon and the oxygen vacancies. The atomic radius of argon (1.64 A; 
ref, 22) is close to the radius of O07 (1.28-1.4 A), but xenon with an 
atomic radius of 1.96 A is probably too large to fit into an oxygen 
vacancy. Our data therefore suggest that noble gases of suitable atomic 
radius can easily be incorporated in minerals that have appropriate 
vacancies. Such vacancies exist in lower-mantle perovskite’**', but 
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Figure 1 | Scanning electron microscope images of MgSiO; perovskite 
samples saturated with noble gases. a, Perovskite crystallized in the presence 
of excess xenon. The bright spots in the image are xenon-filled bubbles in 
perovskite below the sample surface. However, in those parts of the perovskite 
crystals that are far away from bubbles, no xenon can be detected, implying very 
low xenon solubility (<100 p.p.m.). b, Argon-saturated sample with perovskite 
(Py, arrows) coexisting with MgSiO; majorite (garnet structure, Maj). Dark 
spots are holes previously filled by argon gas. No argon can be detected in 
majorite (detection limit 150 p.p.m.), but the perovskite contains 3,000 p.p.m. 
of argon. c, Another sample, containing perovskite (lighter areas) coexisting 
with Mg,SiO, ringwoodite (Rw, dark) and MgSiO; ilmenite (Ilm, dark). 

d, X-ray map of argon distribution in the same sample as shown in c. Argon is 
only found in the perovskite phase. The concentration scale shown corresponds 
to the range from 0 (blue) to approximately 0.8 wt% (green) of Ar. 


not in the minerals of the upper mantle; the behaviour of noble gases in 
the lower mantle therefore fundamentally differs from that observed in 
upper-mantle minerals. Noble-gas solubilities in silicate perovskite 
observed in this study reach the level of weight per cent, implying 
that these phases are actually a new kind of noble-gas compound. 
The dependence of noble-gas solubility on the size of the noble gas 
atom can be well described by a lattice strain model (Fig. 2). The model 
predicts that helium and particularly neon should also be highly 
soluble in perovskite. 

The idea that noble gases are quite soluble in some minerals with 
appropriate vacancies is supported by an observation that was acci- 
dentally made during the course of this study. In an attempt to obtain 
better single crystals of perovskite for X-ray diffraction studies, some 
experiments were carried out with K,CO; as the fluxing agent added to 
the starting material. In these experiments, a K-bearing phase was 
observed that also dissolved up to 0.5 wt% of argon (Fig. 3). Further 
investigations showed that this phase had been previously described as 
‘phase X’. Structure refinements of phase X (ref. 23) suggest that it 
always contains vacancies on the K site, with a general formula of 
K, — .Mg>Si,O7H,. Given that the ionic radius of K* is almost identical 
to the radius of Ar, the high argon solubility observed in this phase is 
not surprising. Another observation in these experiments is that the 
argon solubility in perovskite coexisting with phase X is low. This may 
be due to preferred partitioning of Ar into phase X or to high silica 
activity, because in some of these experiments SiO, stishovite was 
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Figure 2 | A lattice-strain model”’ of noble-gas solubility in MgSiO; 
perovskite. The fit is based on Ar, Kr and Xe solubility data for 25 GPa at 
1,600 °C to 1,800 °C. Estimated values for the solubilities of He and Ne are also 
shown. Fit parameters are the effective Young’s modulus E,, = 35 GPa; radius 
of the oxygen site r, = 1.4 A, and strain-free solubility cy = 1.53 wt%. Noble-gas 
radii are from ref. 22. Error bars are one standard deviation. For further details, 
see Supplementary Information. 


detected as a separate phase. The perovskite in these experiments is 
therefore silica-saturated and no vacancies on the Si** and O*” sites 
are expected. 

Our results offer a simple explanation for the depletion of xenon 
relative to argon observed in Earth’s atmosphere. Very early in Earth’s 
history, our planet was probably completely molten. This magma 
ocean probably coexisted with an early atmosphere containing volatiles 
released upon melting of chondritic materials as well as possibly some 
component of the nebular gas from which the Sun had formed”. 
Crystallization of perovskite from this magma ocean would then have 
concentrated argon and krypton, but much less xenon in the lower 
mantle. At the same time, most of the primary atmosphere was lost 
by hydrodynamic escape due to high surface temperatures, intense 
ultraviolet radiation from the early Sun and meteorite impacts”’°. 
The signature of mass-dependent fractionation due to hydrodynamic 
escape can still be seen today in the relative abundance of xenon 
isotopes in Earth’s atmosphere’. After solidification of the magma 
ocean, mantle convection transported material from the lower mantle 
close to Earth’s surface, where degassing occurred that replenished 
Earth’s atmosphere in primordial argon and krypton, but much less 
in xenon. Evidence for this process may have been recently observed’’, 
as atmospheric xenon trapped in 3.5-billion-year-old fluid inclusions 
appears to be intermediate in isotopic composition between modern 
atmospheric and chondritic xenon. This could imply continued 
degassing of the lower mantle in the Archaean era. 

Recent simulations” suggest significant radial mixing in the Solar 
System during the late stages of accretion, so that volatile-rich material 
from the outer parts of the Solar System was incorporated into the 
growing Earth. We therefore assume that most of the terrestrial volatiles 
were supplied by chondritic material in the main stage of accretion. 
During the crystallization of the magma ocean, most volatiles were 
enriched in the residual melt, so that at the latest stages of crystallization, 
the saturation point for volatiles may have been reached. This probably 
happened in the lower mantle, because water solubility in the lower- 
mantle minerals is exceedingly small”. If the noble gases mix nearly 
ideally with the other gases in the residual gas phase, the mineral/fluid 
partition coefficients of the noble gases will be proportional to the 
measured solubilities. This means that the ratios of Xe:Kr:Ar dissolved 
in lower-mantle perovskite will correspond to the original ratio of these 
gases multiplied by factors of 0.027:0.32:1, the latter being the ratio of 
the solubilities of Xe, Kr, and Ar in perovskite as derived from the fit in 
Fig. 2. 
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Figure 3 | A sample containing phase X (phX) coexisting with MgSiO3 
ilmenite (Ilm), SiO, stishovite (St) and K,CO; carbonate. a, Scanning 
electron microscope image. b, X-ray map of argon distribution; argon is only 


A numerical model of hydrodynamic escape induced by heavy 
impacts suggests that only 15% of the initial xenon, 6% of the initial 
krypton and 0.8% of the initial argon will be retained”. If such a highly 
fractionated residual component is mixed with noble gases outgassed 
from the lower mantle, such that the present-day xenon is one-third a 
residual and two-thirds an outgassed component, the resulting ratio of 
Xe:Kr:Ar in the atmosphere would be 0.041:0.32:1, relative to the 
primordial ratios of these gases. The predicted depletion of xenon 
relative to krypton and argon is in good agreement with observation. 
Moreover, the present-day xenon would contain a third of a heavily 
fractionated component, in agreement with the observed mass- 
dependent fractionation of xenon isotopes. However, atmospheric 
krypton and argon would have nearly primordial isotopic composi- 
tion, containing only a trace of fractionated gas (1.7% for Kr and 0.07% 
for Ar), in agreement with the essentially chondritic ratios of argon and 
krypton isotopes in Earth’s atmosphere. The only shortcoming of this 
model is that it predicts a small (factor of three) fractionation between 
argon and krypton. However, this fractionation may disappear with 
increasing pressure in the lower mantle (see Supplementary Informa- 
tion). Both argon and krypton are very compressible, which could 
change their atomic radii in the deeper part of the lower mantle such 
that no relative fractionation occurs. The precise ratio of the two noble- 
gas components invoked in this model essentially depends on the 
degree of xenon isotope fractionation induced by hydrodynamic 
escape, which itself is a function of assumed bulk atmospheric com- 
position, shock temperature and other parameters”*. Additional pro- 
cesses may be required to explain the relative depletion of the heaviest 
isotopes of xenon’. 

The ‘planetary’ noble-gas pattern with a depletion of xenon relative 
to argon is also seen on Mars’. This may seem surprising, because the 
mantle of Mars probably contains only a thin layer of silicate perovskite. 
However, the present atmosphere of Mars is much thinner than on 
Earth and accordingly, a much smaller degree of mantle degassing 
may be sufficient to produce the characteristic depletion of xenon rela- 
tive to argon. Moreover, the xenon isotopes on Mars show no mass- 
dependent fractionation, which in our model would imply that the 
primordial atmosphere had been lost completely, in line with the low 
gravitational field of Mars. We therefore suggest that the depletion of 
xenon relative to argon is diagnostic for the fractionation of perovskite 
from a magma ocean in the early history of a planet. 


METHODS SUMMARY 


Experiments on noble gas solubility in MgSiO; perovskite were carried out in a 
multi-anvil apparatus. MgSiO; glass containing small amounts (<1 wt%) of Al,O3 
was loaded together with 40-80 bar of argon or xenon into platinum capsules 
using a special gas-loading device. These capsules were first run at 10 GPa and 
1,200 °C for one hour to crystallize MgSiO; enstatite from the glass. The capsules 
were then recovered and re-run without opening at 25 GPa and 1,600-1,800 °C for 
one hour to convert the enstatite into perovskite. This two-stage procedure is 
necessary because in a one-stage run, the conversion of low-density glass into 


present in phase X. The concentration scale shown corresponds to the range 
from 0 (blue) to approximately 1 wt% (red) of Ar. 


high-density perovskite would cause a pressure drop such that the sample moves 
out of the perovskite stability field. Capsules that did not contain excess gas after 
the experiment were discarded. The other capsules were opened and polished. 
Phases were identified by Raman spectroscopy and chemical compositions were 
measured by electron microprobe (major elements and Ar; JEOL JXA-8200) and 
laser-ablation inductively coupled plasma mass spectrometry (ICP-MS) (Kr and 
Xe; Elan DRC-e quadrupole mass spectrometer attached to a Geolas M 193-nm 
ArF Excimer Laser system). Noble-gas-saturated glasses were used as standards for 
Ar, Kr and Xe. 


Full Methods and any associated references are available in the online version of 
the paper. 


Received 31 May; accepted 8 August 2012. 
Published online 10 October 2012. 


1. Anders, E. & Owen, T. Mars and Earth—origin and abundance of volatiles. Science 
198, 453-465 (1977). 

2. Pepin, R. O. & Porcelli, D. Origin of noble gases in the terrestrial planets. Rev. 
Mineral. Geochem. 47, 191-246 (2002). 

3. atsuda, J. & Matsubara, K. Noble gases in silica and their implication for the 

errestrial missing Xe. Geophys. Res. Lett. 16, 81-84 (1989). 

4. Sanloup, C. et a/. Retention of xenon in quartz and Earth’s missing xenon. Science 

310, 1174-1177 (2005). 

5. — Sill, G. T. & Wilkening, L. L. Ice clathrate as a possible source of atmospheres of 

errestrial planets. /carus 33, 13-22 (1978). 

6. Wacker, J. F. & Anders, E. Trapping of xenon in ice—implications for the origin of 

he Earth’s noble gases. Geochim. Cosmochim. Acta 48, 2373-2380 (1984). 

7. Jephcoat, A. P. Rare-gas solids in the Earth’s deep interior. Nature 393, 355-358 

(1998). 

8. Lee, K. K. M. & Steinle-Neumann, G. High-pressure alloying of iron and xenon: 

“missing” Xe in the Earth’s core? J. Geophys. Res. 111, BO2202 (2006). 

9. ishio-Hamane, D., Yagi, T., Sata, N., Fujita, T. & Okada, T. No reactions observed in 

Xe-Fe system even at Earth core pressures. Geophys. Res. Lett. 37, 04302 (2010). 

10. Pepin, R.O. On the origin and early evolution of terrestrial planet atmospheres and 

meteoritic volatiles. /carus 92, 2-79 (1991). 

11. Dauphas, M. The dual origin of the terrestrial atmosphere. /carus 165, 326-339 

(2003). 

12. Pujol, M., Marty, B. & Burgess, R. Chondritic-like xenon trapped in Archean rocks: a 

possible signature of the ancient atmosphere. Earth Planet. Sci. Lett 308, 298-306 

(2011). 

13. Marty, B. The origins and concentrations of water, carbon, nitrogen and noble 

gases on Earth. Earth Planet. Sci. Lett. 313-314, 56-66 (2012). 

14. Heber, V.S., Brooker, R.A, Kelley, S. P. & Wood, B. J. Crystal-melt partitioning of 
noble gases (helium, neon, argon, krypton, and xenon) for olivine and 
clinopyroxene. Geochim. Cosmochim. Acta 71, 1041-1061 (2007). 

15. Kojitani, H., Katsura, T. & Akaogi, M. Aluminum substitution mechanisms in 
perovskite-type MgSiO3: an investigation by Rietveld analysis. Phys. Chem. Mineral. 
34, 257-267 (2007). 

16. Lauterbach, S., McCammon, C. A., van Aken, P., Langenhorst, F. & Seifert, F. 
Mossbauer and ELNES spectroscopy of (Mg,Fe)(Si,Al)O3 perovskite: a highly 
oxidised component of the lower mantle. Contrib. Mineral. Petrol. 138, 17-26 (2000). 

17. McCammon, C. A. Perovskite as a possible sink for ferric iron in the lower mantle. 
Nature 387, 694-696 (1997). 

18. Navrotsky, A. Mantle geochemistry—a lesson from ceramics. Science 284, 
1788-1789 (1999). 

19. Navrotsky, A. et al. Aluminum in magnesium silicate perovskite: formation, 
structure, and energetics of magnesium-rich defect solid solutions. J. Geophys. 
Res. 108, 2330 (2003). 

20. Stashans, A., Piedra, L. & Briceno, T. Fundamental and excited states of F-type 
centres in MgSiO3 perovskite. Physica B 405, 4350-4354 (2010). 

21. Stebbins, J. F. et a/. Aluminum substitution in stishovite and MgSiO3 perovskite: 
high-resolution @”7Al NMR. Am. Mineral. 91, 337-343 (2006). 


25 OCTOBER 2012 | VOL 490 | NATURE | 533 


©2012 Macmillan Publishers Limited. All rights reserved 


LETTER 


22. 
23. 


24. 
25. 


26. 
27. 


28. 


Zhang, Y.X.& Xu, Z. J. Atomic radii of noble gas elements in condensed phases. Am. 
Mineral. 80, 670-675 (1995). 

Yang, H. X., Konzett, J. & Prewitt, C. T. Crystal structure of phase X, a high pressure 
alkali-rich hydrous silicate and its anhydrous equivalent. Am. Mineral. 86, 
1483-1488 (2001). 

Greenwood, R. C., Franchi, |. A., Jambon, A. & Buchanan, P. C. Widespread magma 
oceans on asteroidal bodies in the early Solar System. Nature 435, 916-918 
(2005). 

Porcelli, D., Woolum, D. & Cassen, P. Deep Earth rare gases: initial inventories, 
capture from the solar nebula, and losses during moon formation. Earth Planet. Sci. 
Lett. 193, 237-251 (2001). 

Morbidelli, A. et a/. Source regions and timescales for the delivery of water to the 
Earth. Meteorit Planet. Sci. 35, 1309-1320 (2000). 

Bolfan-Casanova, N., Keppler, H. & Rubie, D. C. Water partitioning at the 660 km 
discontinuity and evidence for very low water solubility in magnesium silicate 
perovskite. Geophys. Res. Lett. 30, 1905 (2003). 

Pepin, R. O. Evolution of Earth’s noble gases: consequences of assuming 
hydrodynamic loss driven by giant impact. /carus 126, 148-156 (1997). 


534 | NATURE | VOL 490 | 25 OCTOBER 2012 
©2012 Macmillan Publishers Limited. All rights reserved 


29. Brooker, R. A. etal. The ‘zero charge’ partitioning behaviour of noble gases during 


mantle melting. Nature 423, 738-741 (2003). 


Supplementary Information is available in the online version of the paper. 


Acknowledgements This work was supported by the German Science Foundation 


(DFG, SPP 1236). We thank A. Audetat for measurements o 
ICP-MS, N. Miyajima for help with TEM studies of the samp 
preliminary X-ray data of Ar-bearing perovskite. 


f Kr and Xe by laser-ablation 
es and T. Boffa-Ballaran for 


Author Contributions S.S.S. carried out all experiments and chemical analyses 
reported in this paper. H.K. suggested this study. Both authors wrote the manuscript 


together. 


Author Information Reprints and permissions information is available at 
www.nature.com/reprints. The authors declare no competing financial interests. 


Readers are welcome to comment on the online version of 
and requests for materials should be addressed to S.S.S. 
(svyatoslav.shcheka@uni-bayreuth.de). 


he paper. Correspondence 


METHODS 


High-pressure experiments. Experiments on noble gas solubility in MgSiO; 
perovskite were carried out in a 5,000-tonne multi-anvil apparatus. MgSiO; glass 
containing small amounts (<1 wt%) of Al,O; was loaded together with 40-80 bar 
of argon or xenon into platinum capsules (2 mm in diameter, 3.5 mm long) using a 
special gas-loading device*’. These capsules were first run at 10 GPa and 1,200 °C 
for one hour to crystallize MgSiO, enstatite from the glass. The capsules were then 
recovered and re-run without opening at 25 GPa and 1,600-1,800 °C for one hour 
to convert the enstatite into perovskite. This two-stage procedure is necessary 
because in a one-stage run, the conversion of low-density glass into high-density 
perovskite would cause a pressure drop such that the sample would move out of 
the perovskite stability field. Experiments were quenched to room temperature 
within a few seconds by shutting off the power. Capsules that did not contain 
excess gas after the experiment were discarded. The other capsules were opened 
and polished. 

Analytical methods. Phases were identified by Raman spectroscopy (Jobin-Yvon 
Labram, 632 nm HeNe laser). Concentrations of all elements except Kr and Xe 
were measured using electron microprobe (JEOL JXA-8200). Analytical condi- 
tions were 15 kV accelerating voltage, 15 nA beam current and a beam diameter of 
5-20 jum. Counting times for each of the major elements (Si, Al; Mg) were 20 s. Al- 
bearing MgSiO; perovskite with known composition was used as a reference 
material. For measuring argon, it was important to minimize the degassing of 
perovskite under the electron beam, which was observed in some samples at a 
beam current of 15nA (Supplementary Fig. 1). Therefore, Ar was measured with 
an accelerating voltage of 15 kV, a beam current of 5nA and a counting time of 
10s. This resulted in degradation of the argon detection limit from 40 wt p.p.m. at 
15nA to 140-160 wt p.p.m. at a beam current of 5nA, but no significant out- 
gassing was observed. A synthetic, Ar-saturated silica glass was used as the Ar 
standard; see the Supplementary Information and Supplementary Fig. 2 for fur- 
ther details. Krypton and xenon contents of perovskites were determined using a 
laser-ablation ICP-MS system consisting of an Elan DRC-e quadrupole mass 
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spectrometer (Perkin Elmer Instruments) attached to a Geolas M 193-nm ArF 
Excimer Laser system (Coherent). Technical details about this system can be found 
in ref. 31. Kr-saturated and Xe-saturated synthetic glasses were used as standards 
for the noble gases. Details on the standards are given in the Supplementary 
Information. Kr- and Xe-bearing MgSiO, perovskites were ablated with a laser 
beam of 30 um or 501m in diameter. The laser was operated at a frequency of 
10 Hz and an output energy of 80 mJ, which results in an energy density of about 
10Jcm “atthe sample surface. The sample chamber was flushed with a mixture of 
helium and hydrogen at a rate of 0.31 min’ and 0.5 ml min‘, respectively, to 
which Ar was added on the way to the ICP-MS. During laser-ablation ICP-MS 
analysis, the instrument background was measured for 40s before each analysis, 
and the sample was measured for another 30 s. The masses of ®2ky, Kr, Kr, and 
®°Kr and the isotopes '”’Xe, '*'Xe, and '*’Xe for Kr and Xe analysis, respectively, 
were measured in steps of 10 ms (Supplementary Fig. 3). Integrated signal intens- 
ities were referenced to synthetic albite glasses according to the routines described 
by ref. 32. Calculated element ratios were then converted into absolute values using 
SiO, contents as an internal standard. To avoid mass interference, synthetic noble- 
gas-free albite glass and MgSiO; enstatite were additionally measured. It was found 
that the **Kr and °°Kr signals are strongly affected by some unknown species. 
Therefore, only the less-abundant isotopes **Kr and **Kr were used in the analyses. 
Limits of detection were 16-23 wt p.p.m. for Kr and 3-6 wt p.p.m. for Xe and 
largely depended on the purity of gases used (He, Hz and Ar). 


30. Boettcher, S.L., Guo, Q. & Montana, A. A simple device for loading gases in high- 
pressure experiments. Am. Mineral. 74, 1383-1384 (1989). 

31. Gtnther, D., Frischknecht, R., Heinrich, C. A. & Kahlert, H. J. Capabilities of an 
Argon Fluoride 193 nm excimer laser for laser ablation inductively coupled 
plasma mass spectrometry microanalysis of geological materials. J. Anal. At. 
Spectrom. 12, 939-944 (1997). 

32. Longerich, H. P., Jackson, S. E. & Gunther, D. Laser ablation inductively coupled 
plasma mass spectrometric transient signal data acquisition and analyte 
concentration calculation. J. Anal. At. Spectrom. 11, 899-904 (1996). 
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Epistasis as the primary factor in molecular evolution 


Michael S. Breen!, Carsten Kemenal, Peter K. Vlasov!, Cedric Notredame! & Fyodor A. Kondrashov!? 


The main forces directing long-term molecular evolution remain 
obscure. A sizable fraction of amino-acid substitutions seem to be 
fixed by positive selection’, but it is unclear to what degree long- 
term protein evolution is constrained by epistasis, that is, instances 
when substitutions that are accepted in one genotype are deleterious 
in another. Here we obtain a quantitative estimate of the prevalence 
of epistasis in long-term protein evolution by relating data on 
amino-acid usage in 14 organelle proteins and 2 nuclear-encoded 
proteins to their rates of short-term evolution. We studied multiple 
alignments of at least 1,000 orthologues for each of these 16 proteins 
from species from a diverse phylogenetic background and found 
that an average site contained approximately eight different amino 
acids. Thus, without epistasis an average site should accept two- 
fifths of all possible amino acids, and the average rate of amino-acid 
substitutions should therefore be about three-fifths lower than the 
rate of neutral evolution. However, we found that the measured rate 
of amino-acid substitution in recent evolution is 20 times lower 
than the rate of neutral evolution and an order of magnitude lower 
than that expected in the absence of epistasis. These data indicate 
that epistasis is pervasive throughout protein evolution: about 90 
per cent of all amino-acid substitutions have a neutral or beneficial 
impact only in the genetic backgrounds in which they occur, and 
must therefore be deleterious in a different background of other 
species. Our findings show that most amino-acid substitutions have 
different fitness effects in different species and that epistasis pro- 
vides the primary conceptual framework to describe the tempo and 
mode of long-term protein evolution. 

The study of the factors determining the tempo and mode of molecu- 
lar evolution continues to be at the forefront of evolutionary biology. 
Since the inception of the neutral theory’, many studies of the rate of 
molecular evolution have focused on the relative role of selection versus 
genetic drift in the fixation of amino-acid substitutions®. It now seems 
certain that both of these factors, selection and genetic drift, contribute 
to a substantial fraction of all amino-acid substitutions in the course of 
evolution!*. However, the neutral-versus-selective debate on the nature 
of molecular evolution has primarily focused on the short-term effects 
of substitutions that may not provide the framework necessary to 
understand the differences between the functional and selective effects 
of amino-acid substitutions that accumulate in the course of long-term 
evolution. 

An amino-acid substitution that is neutral or beneficial in one gen- 
etic context may be deleterious in another”'’. Such a situation, when 
the fitness effect of one allele state depends on the allele states at other 
loci, is called epistasis”’°®. Both the neutral and selective theories of 
protein evolution provide an accurate framework for understanding 
long-term protein evolution only ifamino-acid states in different genetic 
contexts have the same effect on fitness, that is, if epistasis is rare. In the 
absence of epistasis, when the fitness effects of all amino-acid states 
are independent of one another, substitutions in different species are 
expected to have similar effects on fitness except in cases where these 
substitutions enable differences in adaptation to environmental con- 
ditions. In that case, ifan amino-acid state were found in one species in 
a protein sequence that is not directly involved in environmental 


adaptation, such as a housekeeping protein, then the same amino-acid 
state should be acceptable in an orthologous site in a different species. 
However, if epistasis is common then amino-acid substitutions that 
were beneficial or neutral in one species should often be deleterious 
in another. Therefore, unravelling the extent and basis of epistasis may 
be crucial to understanding differences in protein sequences between 
species and long-term protein evolution’’*. At present, studies of the 
differences in the fitness of substitutions in different genetic contexts 
consider specific genes or events'"’°**, and it is unknown what fraction 
of amino-acid substitutions that occur in one species would also be 
acceptable in another species if they were to occur in orthologous sites 
(but see ref. 11). Here we develop an approach to quantifying the 
impact of epistasis in protein evolution and show that the fitness effects 
of most amino-acid substitutions must depend on the genetic context 
in which they occur. 

We obtained sequence data for some of the most widely sequenced 
genes for organelle and nuclear-encoded proteins. The choice of 
genes selected for this study was dictated by four considerations: the 
availability of at least 1,000 unique orthologous sequences from dif- 
ferent species; a well-defined, conserved housekeeping function; the 
absence or a low rate of gene duplication; and conserved sequence with 
few insertions and deletions (indels), leading to an unequivocal multiple 
alignment of the orthologous sequences. Our final data set included 
orthologous sequences from Metazoa, including 13 mitochondrial genes 
and 2 nuclear genes, and the large subunit of ribulose 1,5-bisphosphate 
carboxylase-oxygenase (Rubisco) encoded by the rbcL gene in chloro- 
plast genomes of Viridiplantae. The number of unique sequences from 
different species ranged between 949 and 13,912, with proteins encoded 
in the organelles having more sequences in the alignment (Table 1). The 
sequences were aligned using a version of the T-Coffee algorithm’ 
adapted to align large data sets of up to 100,000 sequences (Methods). 
For each of these alignments, we calculated the amino-acid usage (uw), 
defined as the number of different amino acids observed per site 
(Table 1). We found that the average usage across all genes in our data 
set was ~9, meaning that in the course of long-term evolution an average 
site accepted approximately half of all possible amino acids. The distri- 
bution of u reveals few invariant sites (those where a single amino acid 
was observed), indicating that the high usage was not caused by an 
average of invariant and extremely variable sites (Supplementary Fig. 1). 

Amino-acid usage can be used to predict the expected short-term rates 
of protein evolution without epistasis. Without epistasis, an amino-acid 
state observed in one species should also be acceptable in the orthologous 
site of another species. Therefore, substitutions in a protein sequence 
leading to amino-acid states observed in orthologous sites should not 
be inhibited by negative selection. For example, if one-half of amino 
acids were accepted in a protein sequence in the long term then, in a 
non-epistatic model, the same fraction of amino acids should be 
acceptable to this protein in the course of short-term evolution. This 
prediction can be expressed in terms of the ratio of the per-site rates of 
non-synonymous and synonymous evolution® (dN/dS): if on average 
~0.5 of all amino acids were found per site then the expected non- 
epistatic dN/dS ratio between two closely related orthologues of this 
gene should be ~0.5 (Methods). 
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Table 1 | Amino-acid usage, frequency of non-fixed states and rates of evolution 
Gene Number of species Average Frequency Average amino-acid Expected, non-epistatic Observed Fraction of 
with non-redundant amino-acid of non-fixed usage after Poisson dN/dS ratio from average dN/dS epistatic 
sequences usage state (p) correction Poisson correction ratio evolution 

ATP6 3,02 9.83 541x104 9.39 0.44 0.056 0.87 
ATP8 1,244 2.82 8.24 x104 11.61 0.56 0.224 0.60 
COX1 4,450 6.87 7.13 x 10-4 6.33 0.28 0.015 0.95 
COX2 4,204 9.70 7.64 x 10-4 9.12 0.43 0.025 0.94 
COXx3 2,19 9.43 2.00 x 10-4 TAS 0.32 0.036 0.89 
CYTB 7,954 0.70 1.17 x 10-3 10.68 0.51 0.039 0.92 
ND1 2,056 9.76 499x104 8.41 0.39 0.040 0.90 
ND2 5,963 0.67 1.64 x 10-3 10.66 0.51 0.067 0.87 
ND3 2,852 0.54 6.8 x 10+ 10.24 0.49 0.069 0.86 
ND4 2,04 0.08 531x104 9.00 0.42 0.045 0.89 
ND4L 1,785 1.85 497x107 10.24 0.49 0.076 0.84 
ND5 949 8.97 4.07 x10°-* 7.10 0.32 0.057 0.82 
ND6 1,015 1.28 2.88 x 104 8.97 0.42 0.073 0.83 
EEF1A1 1,743 3.81 5.31 x104 3.05 0.11 0.020 0.82 
H3.2 1,228 5.18 897x104 3.59 0.14 0.037 0.74 
rbcL 13,912 8.65 1.39 x 107-3 8.65 0.40 0.072 0.82 


Under the simplifying assumption that at a site in a large multiple 
alignment all permissible amino-acid states are equally likely to be 
observed at least once, dN/dS should be equal to (u — 1)/19, where 
19 is the number of possible substitutions possible at a site. However, to 
apply amino-acid usage as a measure for estimating the expected non- 
epistatic dN/dS value, it is necessary to take into account potential 
biases that are inherent to usage. Specifically, u is sensitive to erroneous 
inclusions of rare amino-acid states because an amino acid that is 
found only once at a site in an alignment makes the same contribution 
to the measure as an amino acid that is found thousands of times. 
Indeed, the majority of the observed amino-acid states in our align- 
ments are rare (Supplementary Fig. 2), which at first glance seems to 
suggest that sequencing errors or deleterious polymorphisms had a 
large influence on u. However, the frequency distribution of fixed 
amino-acid states is also expected to follow an exponential decay func- 
tion. On a realistic phylogeny with many species, the majority of evolu- 
tionary time is found in recent branches, and many amino-acid states 
are therefore expected to be rare because they would have evolved 
recently (Supplementary Fig. 3). Thus, a majority of fixed events are 
also expected to be rare as a direct consequence of the phylogenetic 
nature of evolution and, therefore, a sophisticated method to take into 
account the impact of rare erroneous states is required. 

There are three potential sources of error in our estimate of amino- 
acid usage: alignment errors, sequencing errors and sequence poly- 
morphisms. The latter two we treat as a single confounding factor of 
non-fixed amino-acid states. We used different approaches to deal with 
potential alignment errors and the contribution of non-fixed states. To 
minimize the effect of misaligned sequences, we post-processed the 
alignments by removing the sequences most likely to be misaligned. In 
a progressive alignment procedure, such as the one used here, mistakes 
usually arise through inaccurate indel accumulation. We therefore 
recursively removed the sequences inducing the largest fraction of indels 
and realigned the remaining sequences (Supplementary Table 1). We 
found that the resulting alignments contained few indels, and different 
alignment methods yielded virtually identical amino-acid usage mea- 
sures (Supplementary Table 2). Thus, we do not expect alignment errors 
to contribute to the usage measure of our data set. 

Next we used a probabilistic approach to estimate the contribution 
of non-fixed amino-acid states to the amino-acid usage measure. First 
we estimated the probability of occurrence of a non-fixed state at one 
site in one sequence, p, by calculating the divergence of sequences from 
the same species (Supplementary Methods). This approach combines 
the estimate of the frequency of occurrence of sequencing errors and 
segregating polymorphisms into one measure. Using multiple sequences 
from the same species from a large subset of those species that were used 
in the original multiple alignment (Supplementary Table 3), we esti- 
mated the amino-acid diversity of non-fixed states, 7,, in the same 
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manner as the commonly used nucleotide diversity measure. From 
™, we then estimated p as 7,/6, where 6 is the average number of 
non-synonymous mutations per codon (Methods). We found that p 
was generally small, with 0.0002 < p< 0.007 across the genes in our 
data set (Table 1). We then calculated the probability of multiple occur- 
rences of the same non-fixed state at one site in the multiple alignment 
using the Poisson approximation of a binomial distribution. The pro- 
bability that a non-fixed state with probability of occurrence p is observed 
k times in an alignment of N sequences is m= (pN ykePN /k!. The pro- 
bability that an amino-acid state is observed at least once out of k times in 
a fixed rather than non-fixed state is r= 1 — m. Thus, at each site the 
amino-acid usage measure that takes into account the probability that 
some of the states were non-fixed is dain it where i is the amino-acid 
state and u is the amino-acid usage at that site. By applying this approach 
to our data, we obtained the amino-acid usage corrected for the contri- 
bution of non-fixed states. This was slightly smaller than the uncorrected 
amino-acid usage (Table 1), indicating that the impact of non-fixed states 
is small. Two different approaches, one probabilistic and one empirical, 
confirm the low impact of non-fixed states on our measurement of 
amino-acid usage (Supplementary Tables 4 and 5). 

Our data show that a site in a protein can, on average, accept about 
eight different amino acids in different species over the course of long- 
term evolution. This estimate is undoubtedly a lower bound owing to 
the restricted phylogenetic breadth and quantity of sequences included 
in the alignment. In our data, u tends to plateau as the number of 
sequences included in the alignment is increased; however, the actual 
plateau of this function is apparently beyond the number of sequences 
that were available for our study (Supplementary Fig. 4), confirming 
our estimate of u as a lower bound. The non-epistatic model predicts 
that an amino acid that is acceptable to one species should be accept- 
able to another, such that the expected non-epistatic dN/dS value of 
orthologues from the same alignment should equal (u — 1)/19; in our 
case, for amino-acid usage of eight, the expected dN/dS value is 7/19, or 
~0.37. We compared this prediction with the observed dN/dS ratio 
estimated for each of the 16 genes in our data set using many of the 
species present in the multiple alignment (Supplementary Table 6). On 
average, the observed dN/dS ratio was about seven times smaller than 
that predicted by the non-epistatic model (Table 1). 

The (u — 1)/19 approximation is a crude estimate of the expected 
dN/dS value because it relies on the assumption that in the multiple 
alignment all permissible amino acids are equally likely to be observed. 
However, some amino-acid states may be more than one mutational 
step away from a sequence in which dN/dS is measured, and such 
states should therefore not be taken into account when comparing 
the expected non-epistatic dN/dS ratio with the observed ratio for this 
particular sequence. We thus estimated the expected dN/dS value 
using only those amino-acid states that are one mutational step away 
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from the sequence for which dN/dS was measured. We estimated the 
non-epistatic dN/dS ratio for these sequences as the average of u,,/Mm 
across all sites, where n,, is the total number of possible amino-acid 
states one mutational step away from the state in a given site and u,, is 
the number of such states observed in the entire alignment. The u)/Mm 
measure of epistatic evolution was larger than that predicted by our 
crude measure (u—1)/19, resulting in a difference between the 
expected non-epistatic dN/dS value and the observed value of more 
than one order of magnitude (Table 2). The higher expected dN/dS 
value in this case can be explained to be the result of closely related 
sequences exploring only a limited part of the sequence space’” or by 
the physicochemical similarity of amino acids that are located in the 
mutational vicinity in the genetic code. 

The large difference between the dN/dS value predicted on the basis of 
amino-acid usage and that observed across a large fraction of the same 
sequences (Supplementary Table 6) indicates that the vast majority of 
amino-acid states are acceptable in a species- or clade-specific manner. 
There are two ways to explain this pattern. First, most amino-acid 
substitutions may have been subject to positive selection for an environ- 
mental adaptation specific to the environment of a particular species. 
Thus, an amino acid that was beneficial to one species because of a 
specific environmental adaptation may be detrimental to a species that 
does not live in the same environment. The second explanation is that 
the fitness of most amino-acid substitutions could have depended on 
the genetic context, or internal cellular environment, of the species in 
which the substitution occurs, such that a substitution beneficial in one 
background would be expected to be detrimental in a different back- 
ground. In other words, epistatic interactions are the norm and not 
the exception when we consider amino-acid substitutions in protein 
sequences. Two considerations indicate that widespread epistasis is the 
most likely explanation for the observed data. First, the McDonald-— 
Kreitman test' showed that positive selection was not common in the 
evolution of the proteins in our data set (Table 3), as would have been 
expected if the majority of substitutions conferred an environmental 
adaptation. Second, the nature of the functions of the proteins con- 
sidered here, especially that of mitochondrial proteins, implies that 
interaction with the external environment was limited. Additionally, 
numerous examples of genetic interactions within proteins, through 
structural interactions within the same molecule*'"'**°”* or interaction 
of cellular components*’**””~*, support the hypothesis that epistatic 
interactions may affect a large majority of all amino-acid substitutions. 

We identify epistasis as a powerful factor affecting long-term protein 
evolution and one that must necessarily be invoked to explain why the 
vast majority of amino-acid substitutions that occur in one species 
cannot occur in another regardless of whether or not positive selection 
plays the dominant role in the course of fixation of amino-acid sub- 
stitutions in specific genetic contexts. An epistatic perspective of 


Table 2 | Fraction of epistatic evolution measured by Un/Nm 


Gene Number of Average expected non- Observed Average fraction 
species with dN/ epistatic dN/dS ratio average of epistatic 

dS estimate estimated by Um/Nm dN/dS ratio —_ evolution (s.d.) 
ATP6 1,300 0.77 0.056 0.93 (0.063) 
ATP8 781 0.52 0.224 0.57 (0.303) 
COX1 1,123 0.81 0.015 0.98 (0.027) 
COX2 1,214 0.81 0.025 0.97 (0.029) 
COX3 622 0.81 0.036 0.96 (0.038) 
CYTB 3,992 0.84 0.039 0.95 (0.034) 
ND1 569 0.81 0.040 0.95 (0.039) 
ND2 3,210 0.79 0.067 0.92 (0.042) 
ND3 989 0.69 0.069 0.90 (0.069) 
ND4 510 0.83 0.045 0.95 (0.033) 
ND4L 441 0.65 0.076 0.88 (0.121) 
ND5 370 0.87 0.057 0.93 (0.033) 
ND6 406 0.70 0.073 0.90 (0.098) 
EEF1A1 1,343 0.77 0.020 0.97 (0.018) 
H3.2 670 0.72 0.037 0.95 (0.090) 
rbcL 13,546 0.88 0.072 0.92 (0.093) 
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Table 3 | McDonald-Kreitman test on select species for each gene 


Gene Average « using Average « using Number of species 
polymorphisms with polymorphisms with with polymorphism 

>1% frequency >5% frequency and divergence 
in the population in the population data 

ATP6 —0.7 (0.06) —0.4 (0.08) 52 

ATP8 —0.1 (0.16) —@:1-(0.15) 18 

COX1 —0.8 (0.08) —0.8 (0.08) 25 

COX2 —0.7 (0.06) —0.4 (0.09) 54 

COx3 —0.5 (0.18) =0,.6 017) 13 

CYTB —0.8 (0.02) —0.4 (0.04) 255 

ND1 —0.6 (0.15) —0.5 (0.18) 16 

ND2 —0.6 (0.06) —0.4 (0.06) 77 

ND3 —0.4 (0.09) —0.3 (0:09) 38 

ND4 —0.6 (0.11) —0.6 (0.09) 22 

ND4L —0.6 (0.11) —0.4 (0.12) 20 

ND5 —0.6 (0.17) —0.7 (0.16) 5 

ND6 —0.4 (0.12) =0.5 G11) 25 

EEF1A1 NA NA NA 

H3.2 —1.0 (0.04) —1.0 (0.03) 11 

rbcL —0.3 (0.70) =1.0 (0.00) 2 


The propensity for positive selection, x, was estimated as 1 — (D,/D,)/(P,/Ps), where D, and P, 
correspond to the number of synonymous fixed differences and polymorphisms, respectively, and D, 
and P, are the corresponding values for non-synonymous substitutions and polymorphisms. A negative 
a value indicates that there were fewer non-synonymous substitutions in evolution than expected given 
the number of non-synonymous polymorphisms. Deleterious polymorphisms with a frequency >5% 
are expected to be rare, and the presented data therefore suggest a minor role for positive selection in 
the evolution of these proteins. The reported values represent averages for all species with 
polymorphism data, with s.e.m. shown in parentheses. Such data were not available (NA) for one gene in 
our data set. 


molecular evolution leads to the formulation of several fundamental 
questions, in addition to the largely unanswered questions posed by 
John Maynard Smith in 1970 (ref. 12). First, given a specific site, sub- 
stitutions in how many other sites in the same gene or in the entire 
genome could change the strength of selection associated with substi- 
tutions at this site? Second, out of the entire network of pairwise epistatic 
interactions between sites across the genome, are there many non- 
overlapping epistatic subnetworks or are most sites interconnected 
within the entire network of epistatic interactions? Third, what is the 
ratio of intergenic to intragenic epistatic interactions? Fourth, what is 
the molecular basis of epistatic interactions within the genome? Finally, 
pervasive epistasis in long-term protein evolution raises the possibility 
that similar epistatic interactions may be prevalent in short-term evolu- 
tion’’”' and that situations when a polymorphism is benign or bene- 
ficial to one individual but deleterious to another individual within the 
same population may be more common than is thought at present. 


METHODS SUMMARY 


Orthologous sequences for 15 genes from Metazoa and 1 from Viridiplantae were 
aligned using an adapted version of the T-Coffee multiple-sequence aligner. Each 
sequence was encoded as a 20°-vector where each component was the frequency of 
amino-acid words of size 3. The vectors were clustered in 200 groups using the 
k-means algorithm. Each resulting group containing fewer than 200 sequences was 
aligned with the default T-Coffee algorithm, and larger clusters were further 
divided by reapplying the k-means algorithm. The resulting alignments were 
treated as profiles and realigned using the multiple-profile alignment procedure 
supported by the default T-Coffee algorithm. A small subset of sequences, ~14% 
on average across genes, contributed a substantial fraction of gaps to the alignment 
and were removed. Amino-acid usage was measured as the number of different 
amino-acid states across all sites in the final multiple alignment of orthologues. We 
used pairwise sequence comparisons to estimate dN/dS values across genes with 
the codeml program in the PAML package for all pairwise comparisons in the data 
set with 0.05 < dS < 0.5. We then averaged the dN/dS estimates across all species 
with dS < 0.5 and then averaged the dN/dS values across such clusters. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Sequences for our study. We obtained amino-acid and nucleotide sequences of 
the protein-coding genes from GenBank”! and applied a two-directional best 
BLAST hit approach* for the initial identification of orthologous sequences. 
The phylogenetic breadths of orthologues that were included in our alignments 
were purposefully restricted to improve the reliability of the multiple alignments. 
We took sequences only from Viridiplantae for Rubisco and only from Metazoa 
for all other proteins. We focused on mitochondrial proteins for four reasons. 
First, mitochondrially encoded proteins are very rarely, if at all, duplicated. 
Second, these proteins are expected to have a highly conserved function across 
the Metazoa taxon. Third, mitochondrial genes have been widely sequenced in 
many phylogenetic studies, so we expected to find sequences from thousands of 
different species. Finally, owing to the constraints on the mitochondrial genome 
and on mitochondrial gene function, we expected to find few insertions and dele- 
tions in these genes across the Metazoan phylogeny. The same line of reasoning 
applied to the inclusion of Rubisco in our data set. Finally, we have undertaken a 
survey of nuclear-encoded genes that were expected to show similar qualities of 
functional conservation and lack of duplications, with at least 1,000 sequences 
from different species. We have specifically mostly avoided sequences from uni- 
cellular organisms, because environmental adaptation may be relevant to a higher 
fraction of genes in such organisms. In multicellular organisms the function of 
many genes takes place in more isolated intracellular environments. Only two 
nuclear-encoded proteins passed our selection criteria, elongation factor 1-alpha 
1 (EEF1A1) and histone (H3.2). 

Alignment method and verification. Sequences were aligned using KM-Coffee, 
an adapted version of the T-Coffee multiple-sequence aligner’’. In KM-Coffee, 
each sequence is encoded as a 20°-vector in which each components the frequency 
of amino-acid words of size 3. The vectors are then clustered in 200 groups using 
the k-means algorithm. Each resulting group containing fewer than 200 sequences 
is aligned with the default T-Coffee algorithm or further divided into smaller 
clusters until each cluster can be aligned by T-Coffee. The resulting multiple 
sequence alignments are then treated as profiles and aligned using the multiple- 
profile alignment procedure supported by the default version of T-Coffee. The 
main advantage of the procedure is its capacity to resolve very large data sets while 
incorporating a consistency-based approach. 

To remove the sequences most likely to be misaligned, we designed a post- 
processing filtering procedure. For each sequence we estimated the indel contri- 
bution index, defined as y (N — NR)/NR, where N is the number of sequences, 
NR is the number of aligned residues in every column and c is the number of 
columns in the alignment. In effect, this measure shares the cost of any gap across 
all aligned sequences. We then sorted the sequences by their indel contribution 
index and removed the top fraction of sequences, seeking a trade-off between 
retaining the maximum number of sequences and maximizing the agreement 
between the three large-scale alignment methods. Overall, by applying the indel 
contribution index we removed 13.5% of sequences across all data sets (Sup- 
plementary Table 1). Remaining sequences were then realigned and the resulting 
alignments used for all subsequent analysis. 

Three methods were available that could align large data sets like the one used 
for this study: MAFFT*’, Clustal Omega** and KM-Coffee. We applied all three to 
the filtered data sets and estimated the similarity of their output alignments, using 
the sum-of-pairs method to compare alternative alignments of the same sequences. 
All three alignment methods show high agreement. The average pairwise similarity 
between the KM-Coffee alignments and the Clustal Omega alignments is 97.7%, 
and to the MAFFT alignments it is 97.8%. The correlation between the corrected 
average amino-acid distributions (Supplementary Table 2) was also high (Pearson 
coefficient >0.999 for all three pairs). These results indicate significant robustness 
when comparing KM-Coffee, Clustal Omega and MAFFT. We also measured the 
similarity of different methods when used on the unfiltered set of alignments and 
saw an increase in similarity when using KM-Coffee after filtering relative to 
Clustal Omega (97.7% versus 94.7%) and MAFFT (97.8% versus 93.2%). 
Measuring amino-acid usage. To measure the amino-acid usage, we calculated 
the number of different amino-acid states across all sites in the final multiple 
alignment of orthologues. We used only sites where more than half of the positions 
were occupied by an amino acid (not a deletion) and took the average across all 
sites in the gene. The number of sites where the amino-acid usage was measured is 
reported in Supplementary Table 2. 

Using amino-acid usage to estimate the expected dN/dS ratio. The amino-acid 
usage measure, u, can be used to obtain a lower-bound estimate of the expected 
dN/dS ratio under the assumption of no epistasis. The rates of non-synonymous 
(dN) and synonymous (dS) evolution are measures of the number of substitutions 
divided by the number of sites®. A crude estimate of non-epistatic dN/dS from u is 
(u — 1)/19, where u — 1 is the number of amino-acid states into which the current 
amino acid can be substituted and 19 is the total number of possible amino-acid 
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substitutions at a site. This estimate is based on the assumption that when dN/dS is 
measured across many pairwise orthologous comparisons, the likelihood of a 
substitution between any of the 20 amino acids is equally likely. In two situations, 
the accuracy of this crude estimate is favourably influenced: when the multiple 
alignment from which u is calculated contains thousands of diverged sequences 
from a diverse phylogenetic background, and when dN/dS is measured as an 
average across pairwise comparisons from a large fraction of sequences in the 
entire multiple alignment. An estimate that takes into account the mutational 
neighbourhood of sequences for which dN/dS is measured should be more accurate 
in all instances. Owing to the limitations of taking into account more than one 
substitution at a site, dN/dS is typically measured between closely related sequences’. 
In this study, we included dN/dS measurements only for cases with 0.05 < dS < 0.5. 
Because the average measurements of dN/dS in our measurements were ~0.05 
(Table 1), this implies that 0.0025 << dN <0.025. Thus, the probability of two 
non-synonymous substitutions occurring at the same site is dN’, which implies that 
in our measurement of dN/dS we mostly take into account non-synonymous sub- 
stitutions just one mutational step away. Thus, we also created a measure of the 
expected non-epistatic dN/dS taking into account only those amino-acid states that 
are one mutation away from the sequence for which dN/dS is measured. For each 
sequence for which dN/dS was measured, we calculated n,, the total number of 
possible amino-acid states one mutational step away from the sequence, and u,,,, the 
number of such states that were found in the multiple alignment. The estimate of the 
expected non-epistatic dN/dS ratio across the entire alignment then becomes an 
average Of U,)/7m across all sequences with observed dN/dS. 

Estimating z, and p. To estimate the expected rate of occurrence of a specific non- 
fixed amino-acid state, we first measured the average amino-acid diversity, 7. 
This measure, analogous to the nucleotide diversity measure’, is the fraction of 
amino-acid mismatches in a pairwise alignment of two sequences from the same 
species. To obtain an average 7, value for each gene in our data set, we first 
obtained an average 7, value for each species with two or more sequences and 
then took an average across all species. The number of different species that were 
used in the estimate of the average 2, value constituted a considerable fraction, 
~60% on average, of the total number of species (Supplementary Table 3). There 
are no reasons to assume that such extensive sampling should be biased towards 
species with a higher or lower 7, value. The amino-acid diversity is a measure of 
the expected density of any non-fixed states. However, for an accurate estimate of 
the contribution of non-fixed states, we required a measure for specific states. The 
TM, Measure was obtained from standing variability, such that most of the segreg- 
ating polymorphisms were a single mutational step away from the consensus 
sequence containing the major alleles. Therefore, the probability of observing a 
specific segregating amino acid is 2, divided by the total number of possible non- 
synonymous mutations away from an average codon. Because on average a codon 
is one mutational step away from ~6 codons coding for a different amino acid, we 
estimated the expected density of specific amino-acid non-fixed states as 7,/6. 
Estimating the impact of non-fixed states. If non-fixed and fixed states have 
constant probabilities of occurrence, p and q=1-— p, respectively, then the 
distribution of non-fixed versus fixed states in the multiple alignment is a binary 
distribution. Thus, the probability that a non-fixed state is observed k times in an 
alignment of N sequences is m= Ck pkg’ ~* where Cy is the number of possible 
combinations of k out of N elements. When N is large, as is the case in our data, 
calculating Ce directly is a computationally intensive problem, and we therefore 
used the Poisson formula, (pN)*e~? /k!, as a proxy for Ck p<q’ —*, which gives a 
good approximation when pN is small. The expected number of times a non-fixed 
state is expected to be observed at a site, pN, was smaller than one for most genes 
(Table 1), justifying the use of the Poisson approximation. 

Independent verification of the minor impact of non-fixed states. To verify 
that the corrected amino-acid usage was not strongly biased by non-fixed amino- 
acid states, we used two approaches that are not based on an estimate of the 
frequency of specific non-fixed amino-acid states. The first approach used a com- 
bination of phylogenetic and probabilistic considerations. Non-fixed states should 
occur relatively randomly on a phylogenetic tree, and the clustering of rare states 
on the phylogeny indicates a high likelihood that such states are fixed states. 
Reducing the number of sequences in the alignment also reduced the probability 
of occurrence of non-fixed states. Thus, we reduced the number of sequences in 
the alignment, to reduce the impact of non-fixed states, while keeping pairs of 
closely related species to increase the probability that each fixed state is found at 
least twice. Owing to the complications involved with reconstructing an accurate 
phylogeny for thousands of species, we used sequence divergence as a proxy of 
phylogenetic distance to select the pairs of sequences from closely related species. 
By reducing the number of sequences in the alignment while keeping pairs of 
sequences from closely related species, it was possible to reduce the expected 
contribution of non-fixed states to almost zero, even for those states that were 
observed a small number of times in the alignment. For each gene in the total data 
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set, we selected 200 pairs of sequences that were the most representative of the 
amino-acid usage of the entire data set. Using blastclust*’ we clustered all of the 
available sequences into 200 clusters, and for each cluster we selected two sequences. 
Thus, in the alignment we retained 200 pairs of sequences from phylogenetically 
diverse species, and within each pair the two sequences were from close relatives. 
For N = 400 and 0.0001 < p < 0.001 (Table 1), the probability that an amino-acid 
state that is observed just once is non-fixed is m=(pN)'e~?% /1!, such that 
0.038 < m<0.27, and the probability that an amino-acid state that has been 
observed two or more times is a fixed state is therefore r= 1—m, such that 
0.962 > r> 0.73. Estimating amino-acid usage from this data set taking into 
account only those states that have been observed more than once yielded lower 
estimates of average amino-acid usage (Supplementary Table 4), as was expected as 
a result of there being a much smaller number of sequences in the alignments. 
However, the measure was still ~6 amino acids per site, which corresponds to the 
expected dN/dS value being much higher than the observed value (Table 1). 

We also used an empirical approach to test the impact of non-fixed states on our 
amino-acid usage measure. We eliminated rare non-fixed states directly, by pro- 
ducing an alignment using consensus sequences for each species with three or more 
sequences. The consensus sequence was constructed such that the amino-acid 
sequence at each site was identical to the most common amino-acid sequence in 
that species. This approach should effectively eliminate most of the rare states that 
are not expected to reach fixation (the majority of such states are expected to be 
rare’). The amino-acid usage estimated using consensus sequences from ~5% of 
all available species was, as expected, lower than the amino-acid usage calculated 
using all sequences. However, when we estimated the amino-acid usage from the 
same number of random sequences as there were consensus sequences for each 
gene, we obtained very similar results (Supplementary Table 5). This provides 
independent empirical evidence that the contribution of non-fixed states to 
amino-acid usage is minor. 

Measuring dN/dS and the McDonald-Kreitman test. Because of the imposs- 
ibility of reconstructing accurate phylogenies across our entire data set, we used 


pairwise sequence comparisons to estimate dN/dS values across genes. We esti- 
mated the dN/dS values using the codeml program in the PAML package” for all 
pairwise comparisons in the data set and kept only those comparisons for which 
0.05 < dS <0.5, to eliminate unreliable estimates. To minimize the impact of 
clades with many closely related species, we first averaged the dN/dS estimates 
for all species with dS < 0.5 and then averaged the dN/dS values across clusters of 
pairwise comparisons such that dS > 0.5 between any two sequences belonging to 
different clusters. For Rubisco, the same procedure was followed except the max- 
imum dS cut-off value between sequences was 0.2. The resulting dN/dS estimates 
came from a wide phylogenetic background from many non-overlapping clusters 
and show small standard deviations (Supplementary Table 6). We applied the 
McDonald-Kreitman test! in all instances when at least 20 sequences from the 
same species were available that had at least five polymorphic sites. All values of 
a< —1 were treated as —1. 
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Phosphorylation of NLRC4 is critical for 


inflammasome activation 
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NLRC4 is a cytosolic member of the NOD-like receptor family that 
is expressed in innate immune cells. It senses indirectly bacterial 
flagellin and type III secretion systems, and responds by assembling 
an inflammasome complex that promotes caspase-1 activation and 
pyroptosis’ °. Here we use knock-in mice expressing NLRC4 with a 
carboxy-terminal 3xFlag tag to identify phosphorylation of 
NLRC4 on a single, evolutionarily conserved residue, Ser 533, 
following infection of macrophages with Salmonella enterica 
serovar Typhimurium (also known as Salmonella typhimurium). 
Western blotting with a NLRC4 phospho-Ser 533 antibody con- 
firmed that this post-translational modification occurs only in the 
presence of stimuli known to engage NLRC4 and not the related 
protein NLRP3 or AIM2. Nirc4’~ macrophages reconstituted with 
NLRC4 mutant $533A, unlike those reconstituted with wild-type 
NLRC4, did not activate caspase-1 and pyroptosis in response to 
S. typhimurium, indicating that $533 phosphorylation is critical for 
NLRC4 inflammasome function. Conversely, phosphomimetic 
NLRC4 $533D caused rapid macrophage pyroptosis without infec- 
tion. Biochemical purification of the NLRC4-phosphorylating 
activity and a screen of kinase inhibitors identified PRKCD 
(PKCS) as a candidate NLRC4 kinase. Recombinant PKC6 
phosphorylated NLRC4 $533 in vitro, immunodepletion of PKCS 
from macrophage lysates blocked NLRC4 $533 phosphorylation in 
vitro, and Prkcd~’~ macrophages exhibited greatly attenuated 
caspase-1 activation and IL-1 secretion specifically in response to 
S. typhimurium. Phosphorylation-defective NLRC4 $533A failed to 
recruit procaspase-1 and did not assemble inflammasome specks’ 
during S. typhimurium infection, so phosphorylation of NLRC4 
$533 probably drives conformational changes necessary for 
NLRC4 inflammasome activity and host innate immunity. 

NLRC¢4 is activated as part of the innate immune response to a range 
of intracellular bacteria, including S. typhimurium and Legionella 
pneumo hila’. To explore the details of NLRC4 activation, we created 
Nirc4'”* knock-in mice expressing NLRC4 with a C-terminal 3x Flag 
tag (Supplementary Fig. 1a, b). The 3Flag tag did not compromise 
NLRC4 function because Nirc4‘”” bone-marrow-derived macrophages 
(BMDMs), unlike Nirc4/— BMDMs, processed caspase-1 and 
secreted IL-1 like their wild-type counterparts following infection 
with S. typhimurium or transfection with purified S. typhimurium 
flagellin (Supplementary Fig. 1c). Nirc4’’” BMDMs also exhibited 
normal IL-1f secretion in response to ATP and double-stranded 
DNA (dsDNA; Supplementary Fig. 1c), stimuli that engage not 
NLRC4, but NLRP3 and AIM2, respectively”’’. Mass spectrometry 
of NLRC4-3 Flag affinity purified from Nirc4’”” BMDMs identified 
a single amino acid, Ser 533, as being phosphorylated after infection 
with S. typhimurium (Fig. 1a and Supplementary Fig. 2a). Peptides 
containing phosphorylated Ser 533 represented nearly a quarter of all 


Ser 533-containing peptides (Fig. 1b). Located between the NACHT 
domain and C-terminal leucine rich repeats (LRRs; Fig. 1a), Ser 533 is 
conserved in mammals (Fig. 1c). Western blotting with a phospho- 
Ser 533 NLRC4 antibody (Supplementary Fig. 2b) after immuno- 
precipitation of NLRC4-3XFlag confirmed phosphorylation of 
Ser533 in BMDMs infected with S. typhimurium. Significantly, 
Ser 533 phosphorylation coincided with NLRC4-dependent proces- 
sing of caspase-1 and IL-1 (Fig. 1d). 
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Figure 1 | NLRC4 Ser 533 is phosphorylated in response to S. typhimurium. 
a, Mass spectrometry (MS) of NLRC4-3 Flag from BMDMs infected with S. 
typhimurium identified phosphorylation at Ser 533. b, Quantification of 
NLRC4 phosphopeptide QEpSIQSLR and its unphosphorylated counterpart. 
c, Alignment of NLRC4 orthologues. The depth of purple shading indicates the 
degree of residue conservation. Asterisk, mouse Ser 533. d—f, Western blot 
(WB) analysis of Nirc4’” BMDMs and their culture medium after infection 
with S. typhimurium (d), L. pneumophila (e) or transfection with flagellin 

(f). NLRC4-3 X Flag was immunoprecipitated (IP) before western blot. Data are 
representative of three independent experiments. 
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Next we determined whether BMDMs phosphorylated NLRC4 
Ser 533 in response to other NLRC4-activating stimuli. L. pneumophila 
infection (Fig. le) and transfection with purified S. typhimurium 
flagellin (Fig. 1f) both triggered NLRC4 Ser 533 phosphorylation that 
coincided with caspase-1 processing. In contrast, robust Ser 533 phos- 
phorylation was not detected in LPS-primed BMDMs treated with 
ATP, nigericin or dsDNA (Fig. 2a), indicating that NLRC4 Ser 533 
phosphorylation is specific to NLRC4-activating stimuli and not a 
general consequence of caspase-1 activation. In keeping with the latter 
notion, caspase-1 inhibitor Ac-YVAD-fmk decreased self-processing 
of caspase-1 in BMDMs infected with S. typhimurium, but did not 
reduce NLRC4 Ser533 phosphorylation (Fig. 2b). To delineate 
whether flagellin is essential for NLRC4 Ser 533 phosphorylation after 
S. typhimurium infection, we infected BMDMs with S. typhimurium 
AflgB, which lacks flagellin, but has an intact type III secretion system 
(T3SS)*. Whereas BMDMs infected with the AflgB mutant yielded less 
NLRC4 phospho-Ser 533 than cells infected with wild-type bacteria 
(SL1344 or 12023), no NLRC4 phospho-Ser 533 was detected when 
BMDMs were infected with a AprgJ mutant, which lacks essential 
components of the SPI1 T3SS, or a AsipB mutant, which lacks the 
virulence factor known to induce apoptosis and caspase-1 activation*'* 
(Fig. 2c). These data indicate that phosphorylation of NLRC4 Ser 533 
probably reflects both flagellin and T3SS recognition. 

We investigated whether NLRC4 Ser533 phosphorylation is 
required for inflammasome function by reconstituting immortalized 
macrophage progenitors’ from Nirc4 ’~ mice with wild-type NLRC4 
or NLRC4 mutant $533A. Independent clonal progenitor populations 
were established that expressed comparable amounts of NLRC4 after 
differentiation into macrophages (Supplementary Fig. 3a). Nirc4”/~ 


cells reconstituted with wild-type NLRC4 processed caspase-1 
(Fig. 2d), secreted IL-1 (Fig. 2e) and underwent pyroptosis (Fig. 2f) 
upon infection with S. typhimurium. In contrast, Nirc4“~ cells recon- 
stituted with NLRC4 $533A behaved like Nirc4”’~ cells, neither cleav- 
ing caspase-1 nor secreting IL-1B, and demonstrating improved cell 
viability (Fig. 2d-f). Similar results were obtained when the reconsti- 
tuted macrophages were infected with L. pneumophila (Fig. 2d). 
Importantly, the NLRC4 $533A macrophages resembled wild-type 
NLRC4 reconstituted macrophages in their ability to secrete IL-6 after 
S. typhimurium infection (Supplementary Fig. 3b), and IL-1 when 
treated with lipopolysaccharide (LPS)/ATP or LPS/dsDNA (Fig. 2e), 
indicating that they have a specific defect in NLRC4-dependent 
caspase-1 activation. 

Inflammasome components tend to aggregate and kill cells when 
overexpressed'® so another assay for inflammasome function is 
whether reconstituted inflammasome components can be expressed 
stably. The retroviral vector used to reconstitute NLRC4 expression in 
Nirc4-‘~ immortalized progenitors also expressed green fluorescent 
protein (GFP), which allowed us to monitor the frequency of trans- 
duced cells. GFP* progenitor cells transduced with empty vector, 
wild-type NLRC4, NLRC4 S533A or phosphomimetic mutant 
NLRC4 $533D accounted for 82-93% of live cells at 3 days after infec- 
tion (Supplementary Table 1). GFP™ cells were sorted, grown for a 
further 15 days, and then subjected to single-cell sorting. At this later 
time, GFP* cells reconstituted with wild-type NLRC4 represented 
<10% of live cells in the culture, whereas 38% of cells in cultures 
receiving NLRC4 $533A virus remained GFP". These results imply 
that NLRC4 S533<A is less deleterious to macrophage progenitors than 
wild-type NLRC4, which is consistent with phosphorylation of Ser 533 
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Figure 2 | NLRC4 Ser 533 phosphorylation is essential for NLRC4- 
dependent caspase-1 processing. a-c, Nirc4’”” BMDMs infected with S. 
typhimurium SL1344 for 4h, except where indicated (a, c). b, YVAD-fmk was 
added 30 min before infection. d-f, Nirc4 /— progenitors were immortalized 
with ER-Hoxb8 and reconstituted with wild-type or $533A NLRC4. Clonal 
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- S. typhimurium LPS+ATP LPS+dsDNA 


populations were differentiated into macrophages for bacterial infection for 4h, 
or treatment with LPS/ATP or LPS/transfected dsDNA for 30 min. e, f, IL-1B 
(e) or LDH (f) release is the mean = s.d. of three independent experiments. P 
values determined by two-tailed t-test. All immunoblots show cells plus their 
culture medium. 
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being important for inflammasome function. Further supporting this 
notion, cells transduced with phosphomimetic mutant NRLC4 $533D 
comprised <1% GFP* cells after 15 days and we were unsuccessful 
at isolating a stable, clonal progenitor line. In sum, our data indicate 
that phosphorylation of NLRC4 on Ser 533 is critical for NLRC4 
inflammasome activation. 

We sought the kinase responsible for phosphorylating NLRC4 
Ser 533 in fractionated Nirc4*/* BMDM cytosolic extracts (Supplemen- 
tary Fig. 4a). Fractions that phosphorylated Ser 533 of NLRC4-3 x Flag 
in vitro were fractionated further and then analysed by mass spectro- 
metry (Supplementary Fig. 4b). PKCd and PAK2 were identified 
as candidate NLRC4 kinases (Supplementary Table 2), but only 
immunodepletion of PKC3 from Nirc4*/* BMDM lysates reduced 
phosphorylation of NLRC4-3 Flag Ser 533 in vitro (Fig. 3a, b). In 
addition, recombinant PKC6 phosphorylated NLRC4 Ser 533 in vitro 
(Fig. 3c). Consistent with, but not proving NLRC4 $533 phosphoryla- 
tion by PKC6, the non-selective PKC inhibitors staurosporine’® and 
K-252a"” reduced $533 phosphorylation in Nirc4’’” BMDMs infected 
with S. typhimurium (Fig. 3d). Also implicating PKCS involvement, 
phosphorylation of PKC6 Tyr 311, which is a modification induced by 
PMA and certain other stimuli!®, increased in BMDMs infected with S. 
typhimurium (Supplementary Fig. 4c). 

Initially we explored whether PKC6 activity is important for 
inflammasome activation using rottlerin, which often is used as a 
PKC inhibitor but can also have other activities’’. Rottlerin blocked 
NLRC4 $533 phosphorylation, caspase-1 and IL-1 processing, and 
pyroptosis in BMDMs infected with S. typhimurium, and this did not 
seem to be due to impaired uptake of the bacteria (Supplementary 
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Figure 3 
infection. a-c, In vitro kinase assays using NLRC4—3 x Flag from Nirc4 
BMDMs and Nirc4*’* BMDM lysates (a) or recombinant PKC6 (c). Western 
blots (b) indicate PKC5 or PAK2 immunodepletion in a. d, NLRC4 Ser 533 
phosphorylation (pNLRC4) after Nirc4’’” BMDMs were cultured with 5 1M 
inhibitor or dimethylsulphoxide (DMSO) vehicle for 30 min before and during 
S. typhimurium infection for 3 h. e, f, BMDMs infected with S. typhimurium 
(multiplicity of infection 5) (e), or primed with LPS and then treated with ATP 
or transfected dsDNA for 30 min (f). Cells plus medium were immunoblotted. 
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Fig. 5a—c). Experiments with Prkcd ’~ macrophages lacking PKCS, 
however, provided more compelling evidence of a role for PKC6 in 
activation of the NLRC4 inflammasome. Specifically, Prked/— 
BMDMs showed markedly less caspase-1 activation and IL-f proces- 
sing than Prkcd*’* BMDMs after infection with S. typhimurium 
(Fig. 3e). This defect in caspase-1 activation and IL-1B release was 
not observed when the Prkcd ‘~ BMDMs were stimulated with ATP 
or dsDNA (Fig. 3f), consistent with a specific requirement for PKC6 in 
NLRC4-dependent caspase-1 activation. The caspase-1 and IL-1 pro- 
cessing that does occur in Prkcd /~ cells infected with S. typhimurium 
suggests the existence of a compensatory mechanism of NLRC4 activa- 
tion, albeit a less efficient one. This residual inflammasome activity 
probably explains why S. typhimurium-induced pyroptosis is not 
reduced significantly in Prkcd ‘~ BMDMs (Supplementary Fig. 6c). 

We have shown previously by immunofluorescence microscopy 
that endogenous apoptosis-associated speck-like protein containing 
a caspase recruitment domain (ASC) forms a single focus in primary 
BMDMs infected with S. typhimurium’. This focus co-localizes with 
active caspase- 1 and its formation correlates with IL-1f processing and 
release. We obtained similar results with macrophages differentiated 
from Nirc4*’* immortalized macrophage progenitors or Nirc4 ’~ 
cells reconstituted with wild-type NLRC4 (Fig. 4a, b). Approxi- 
mately 30% of infected cells displayed a bright ASC focus that 
co-stained for the processed caspase-1 p10 subunit at 3h after infec- 
tion. In contrast, both Nirc4’/~ macrophages and Nirc4~’~ cells 
reconstituted with NLRC4 $533A did not form foci containing ASC 
and caspase-1 (Fig. 4a, b). These data are consistent with NLRC4 
Ser 533 phosphorylation being essential for procaspase-1 recruitment 
to the NLRC4 inflammasome complex. Consistent with PKCé con- 
tributing to NLRC4 inflammasome activation, fewer Prkcd~/~ 
BMDMs contained an ASC focus when compared to wild-type 
BMDMs (Supplementary Fig. 6a, b), despite equivalent uptake of the 
bacteria (Supplementary Fig. 6d). 

Next we examined NLRC4 protein interactions by immunopreci- 
pitating wild-type NLRC4 or NLRC4 $533A from macrophages before 
and after S. typhimurium infection (Fig. 4c). Nirc4’’” BMDMs were 
grown in the presence of the pan-caspase inhibitor zV AD-fmk to limit 
cell death, and then a reversible cross-linker was used to stabilize 
protein complexes for immunoprecipitation. These conditions were 
essential to detect the association of endogenous NLRC4-3 x Flag with 
endogenous ASC and pro-caspase-1 at 2 and 3h after infection. 
Neither interaction was detected in uninfected BMDMs, consistent 
with S. typhimurium inducing NLRC4 inflammasome assembly. 
Remarkably, PKC also co-precipitated with NLRC4-3 Flag from 
infected cells, but not uninfected cells, providing further biochemical 
evidence of its participation in the NLRC4 inflammasome. We did not 
culture reconstituted Nirc4-’~ macrophages expressing NLRC4 
$533A in zVAD-fmk because the cells are resistant to S. typhimurium- 
induced pyroptosis (Fig. 2f). Unfortunately, Nirc4-/~ macrophages 
reconstituted with wild-type NLRC4 did not tolerate culture in 
zVAD-fmk and could not be analysed in parallel. Given that anti- 
Flag antibodies detect NLRC4-3Flag more readily than $533A 
NLRC4-1XFlag, we used a lot more of the immunoprecipitated 
NLRC4 $533A in western blots to favour complex detection. Despite 
this bias, we failed to detect any interaction of NLRC4 $533A with 
pro-caspase-1, ASC or PKC6 after S. typhimurium infection (Fig. 4c). 
These data indicate that phosphorylation of NLRC4 $533 is critical for 
inflammasome assembly. 

It has been shown that specific NLR family, apoptosis inhibitory 
proteins (NAIPs) engage NLRC4 after interacting directly with flagellin 
or T3SS components such as Prg]*®. Our data indicate that phosphor- 
ylation of NLRC4 Ser 533 by kinases such as PKC6 represents another 
important step in the formation of a fully functional inflammasome 
(Fig. 4d). It will be interesting to learn whether phosphorylation reg- 
ulates the related NLRP3 inflammasome, which is mutated in chronic 
inflammatory autoimmune syndrome”. 
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Figure 4 | NLRC4 $533 phosphorylation is required for inflammasome 
assembly after S. typhimurium infection. a, b, Immunofluorescence 
microscopy of macrophages grown with S. typhimurium and zV AD-fmk for 
3h. Bars, 10 jum (a). The percentage of cells containing an ASC focus = s.d. was 
determined from three independent experiments (b). P values determined by 
two-tailed t-test. c, Nirc4’”” BMDMs and Nirc4 ’~ NLRC4 $533A 
reconstituted macrophages were infected with S. typhimurium and NLRC4 
immunoprecipitated after cross-linking. d, Model for NLRC4 activation. 
NAIPs recognizing specific bacterial ligands and phosphorylation of NLRC4 
Ser 533 between the NACHT and LRR motifs together promote NLRC4 
inflammasome activation. 


METHODS SUMMARY 

Mice. Nirc4”" mice were generated from C57BL/6 ES cells containing a 3X Flag 
sequence knocked into exon 9 of Nirc4 ahead of its translation termination codon 
(Supplementary Fig. 1a). Gene Bridges engineered the targeting construct. The 
Genentech Institutional Animal Care and Use Committee approved all mouse 
protocols. 

Mass spectrometry. Nirc4!”” BMDMs were infected with S. typhimurium for 4h. 
Immunoprecipitations with anti-Flag agarose were performed on BMDM lysate 
and culture medium. Proteins eluted with 3 Flag peptide were resolved by SDS- 
PAGE, and bands corresponding to NLRC4 were excised, reduced, alkylated and 
digested in situ with trypsin or chymotrypsin as described previously’'. Extracted 
peptides were analysed by ultra high-pressure, reverse-phase chromatography 
(nanoAcquity; Waters Corp.) electrospray ionization tandem mass spectrometry 
(LTQ-Orbitrap; ThermoFisher Scientific) in a ‘top-8’ data-dependent experi- 
ment”. Tandem mass spectra were searched against a database of mouse proteins 
using the Mascot program (Matrix Science Ltd). Hits were filtered to a false 
discovery rate of <1% and spectra of putative phosphopeptides validated by 
inspection. Phosphopeptide QEpSIQSLR and its unphosphorylated counterpart 
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were quantified by isotope dilution using stable-isotope-labelled synthetic 
peptides (Cell Signaling Technology) spiked into tryptic digests of NLRC4- 
3 Flag and analysed by LC-orbitrap mass spectrometry. 

Crosslinking. BMDMs were cultured with 30 uM zV AD-fmk for 30 min and then 
infected with S. typhimurium in the continued presence of 30 1M zVAD-fmk. A 
cleavable imidoester crosslinker DTBP (dimethyl 3,3'-dithiobispropionimidate-2 
HCl) was added to cultures 30 min before cell lysis. Cell extracts were obtained by 
addition of 100 mM Tris pH 8.0 and 1% Triton X-100. NLRC4 was immunopre- 
cipitated with anti-Flag agarose overnight. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Antibodies and reagents. Rabbit antibodies from Santa Cruz (sc-213) and Cell 
Signaling (2058) were used to immunoprecipitate and immunoblot PKCé, 
respectively. Rabbit antibodies A301-263A and A301-264A (Bethyl) were com- 
bined against PAK2. Other antibodies detected Flag (M2, Sigma), ASC (Genentech 
rat clone 8E4.1), NLRC4 (Genentech hamster clone 1D4.8.3), phosphorylated 
PKC6 (Cell Signaling 2055), caspase-1 (Santa Cruz sc-514; Genentech rat clone 
4B4), actin (Santa Cruz sc-1616), and IL-1 (Biological Resources Branch, NCI). 
Rabbits immunized with NLRC4 phosphopeptide NH,-LWRQEpSIQSLR-OH 
(NLRC4 phosphorylation site Ser 533 is in bold; ProSci) were used to generate 
monoclonal GEN-82 clone 3-3 (Genentech). 

Reagents included YVAD-fmk (BV-1012-3) and zVAD-fmk (4800-520) from 
MBL, crosslinker DTBP (20665) from Thermo Scientific, recombinant human 
PKC6 from R&D Systems (4858-KS), Rottlerin from Enzo Life Sciences (350- 
075-M010), and G66976 (365250), PKC 0 inhibitor (539649), and a kinase inhibitor 
library from EMD Chemicals. 

Bacterial strains. Bacterial strains included S. typhimurium SL1344 wild-type and 
AsipB, S. typhimurium 12023 wild-type and mutants AprgJ, AflgB, AprgJAflgB. L. 
pneumophila was from ATCC (33152). 

Mice. Nirc4’”” mice were generated from C57BL/6 embryonic stem cells contain- 
ing a 3XFlag sequence knocked into exon 9 of Nirc4 ahead of its translation 
termination codon (Supplementary Fig. 1a). Gene Bridges engineered the target- 
ing construct. PCR genotyping primers (5’-AGCAACATTCCATTTCCCGG-3’ 
and 5'-CACTGTGTGAGCAGTGACGGAT-3’) yielded 273-base-pairs (bp) 
wild-type and 422-bp knock-in DNA fragments (Supplementary Fig. 1b). 

The Genentech Institutional Animal Care and Use Committee approved all 
mouse protocols. 

Macrophage cultures and stimulations. Bone marrow cells were grown 5-7 days 
in DMEM containing 20% L cell-conditioned medium, 10% heat-inactivated FBS, 
100 Uml' penicillin, 10mM 1-glutamine, and 100mgml~’ streptomycin. 
BMDMs were then replated in 6- or 24-well plates for experiments. BMDMs were 
primed with 0.5 pg ml * Escherichia coli LPS serotype 01101:B4 (List Biological 
Laboratories) for 4h and then stimulated with ATP (5mM; Sigma), DOTAP- 
transfected flagellin (20 pg ml !; InvivoGen), or Lipofectamine 2000-transfected 
peDNA3.1 (1 pg ml‘; Invitrogen). Multiplicity of infection for bacteria was 20 
unless indicated otherwise. Infections were performed as described’*. A 
Cytotoxicity Detection Kit?’ (Roche 04744934001) measured LDH release. 
IL-1 secretion was measured by luminex assay. 

Immortalized macrophage progenitors. Macrophage progenitors were immor- 
talized with ER-Hoxb8 ex vivo as described”’. Nirc4‘~ cells were reconstituted with 
retroviral vector pMX-GFP (Cell Biolabs) containing mouse NLRC4 (wild-type or 
S533A) with a C-terminal Flag tag. Retrovirus particles were generated by calcium 
phosphate transfection of Phoenix.eco packaging cells. GFP* cells were sorted 
3 days after spin infection in a FACSAria (BD Biosciences). After a further 15 days, 
GEP* cells were sorted one cell per well to obtain clonal progenitor lines. 
Immortalized macrophage progenitors were grown in RPMI-1640 containing 
10% heat-inactivated FBS, 100 U ml! penicillin, 10 mM L-glutamine, 100 mg ml” q 
streptomycin, 1mM _ f-oestradiol (Sigma), and 10ng ml | GM-CSF (R&D 
Systems). B-oestradiol removal for 5-7 days promoted macrophage differentiation. 
Mass spectrometry. Nirc4’’” BMDMs were infected with S. typhimurium for 4h 
and then NLRC4-3XFlag was immunoprecipitated from BMDM lysate and 
culture medium with anti-Flag agarose. Proteins were eluted with 3 x Flag peptide, 
and bands corresponding to NLRC4-3XFlag after SDS-PAGE were excised, 
reduced, alkylated and digested in situ with trypsin or chymotrypsin as 
described*'. Extracted peptides were analysed by ultra high-pressure, reverse- 
phase chromatography (nanoAcquity; Waters Corp.) electrospray ionization 
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tandem mass spectrometry (LTQ-Orbitrap; ThermoFisher Scientific) in a 
‘top-8’ data-dependent experiment’. Tandem mass spectra were searched 
against a database of mouse proteins using the Mascot program (Matrix Science 
Ltd). Hits were filtered to a false discovery rate of <1% and spectra of putative 
phosphopeptides validated by inspection. Phosphopeptide QEpSIQSLR and its 
unphosphorylated counterpart were quantified by isotope dilution using stable- 
isotope-labelled synthetic peptides (Cell Signaling Technology) spiked into tryptic 
digests of NLRC4-3 Flag and analysed by LC-orbitrap mass spectrometry. 
Crosslinking. BMDMs were grown with 30 1M zVAD-fmk for 30 min and then 
infected with S. typhimurium in the continued presence of 30 uM zVAD-fmk. A 
cleavable imidoester crosslinker DTBP (dimethyl 3,3’-dithiobispropionimidate-2 
HCl) was added to cultures 30 min before cell lysis. Cell extracts were obtained by 
addition of 100 mM Tris pH 8.0 and 1% Triton X-100. NLRC4 was immunopre- 
cipitated with anti-Flag agarose overnight. 

In vitro kinase assays. NURC4-3 Flag was affinity-purified from 5 X 500 cm* 
plates of BMDMs in lysis buffer containing 20 mM Tris pH7.8, 10mM MgCh, 
90 mM NaCl, 0.1% Triton X-100, EDTA-free protease inhibitor cocktail (Roche), 
and Halt phosphatase inhibitor cocktail (Thermo Scientific). Anti-Flag agarose 
incubated with soluble cell lysate overnight at 4 °C was washed once with buffer 1 
(20mM HEPES pH7.9, 1.5mM MgCh, 420mM NaCl, 1.5M urea, 0.2mM 
EDTA, 25% glycerol, protease and phosphatase inhibitors) and three times with 
buffer 2 (20mM Tris pH7.5, 300 mM NaCl, 0.2 mM EDTA, 20% glycerol, 0.1% 
NP4O, protease and phosphatase inhibitors). Beads were eluted with 0.5 mg ml! 
3 Flag peptide (Sigma) in PBS containing 0.1% Tween 20. In vitro kinase assays 
combined wild-type BMDM lysate (60 ng total protein) with affinity purified 
NLRC4-3 Flag at 30 °C for 1h. 

NLRC4 kinase identification. Wild-type BMDMs (1 X 500 cm? plate) were 
swollen in 1ml of hypotonic buffer (25mM Tris pH7.5, 35mM NaCl, and 
10mM MgCl,) for 10min, Dounce-homogenized (100 strokes), and subjected 
to ultracentrifugation (100,000g, 30 min) at 4°C. 0.1% reduced Triton X-100 
(Sigma) was added to the supernatant, and 0.5 ml was fractionated with a 
Superdex 200 10/300 GL column (GE). Fractions positive by in vitro kinase 
assay were concentrated (using a 30-kDa cut off) and applied to a MonoQ column 
(1 ml, GE). Washes included 2 column volumes of hypotonic buffer + 0.1% 
reduced Triton X-100, and then the gradient was increased to 50% high salt 
buffer (600 mM NaCl) for three column volumes. Fractions positive by in vitro 
kinase assay underwent further MonoQ chromatography with a gradient of 
0-100% high salt, and then Superdex 200 10/300 GL chromatography as before. 
Final positive fractions were pooled and resolved by SDS-PAGE for mass 
spectrometry (LTQ-orbitrap XL). Bands were excised ‘top-to-bottom’ from the 
entire gel lane. 

Confocal microscopy. Macrophages differentiated from immortalized progenitors 
were infected in four-well chamber slides with S. typhimurium for 3h. Cells were 
washed with PBS, fixed with 4% paraformaldehyde in PBS for 30min, and 
permeabilized with 0.1% Triton X-100 for 15 min. Blocking buffer (2% BSA in 
PBS) was applied for 15 min, and then cells were stained with primary antibodies 
(8E4 anti-ASC at 2 ug ml 1 anti-CASP1 sc-514 diluted 1:100; anti-S. typhimurium 
polyclonal antibody’ diluted 1:20,000) for 1h in blocking buffer. Cells were 
washed with PBS three times and incubated with Alexa Fluor-coupled secondary 
antibodies (1:250; Invitrogen) for 30 min. Cells were washed four times with PBS, 
mounted in ProLong Gold Antifade with DAPI (Invitrogen), and imaged with a 
Leica TCS SPE confocal microscope. 400 cells were examined in three independent 
experiments. 


23. Wang, G.G. et al. Quantitative production of macrophages or neutrophils ex vivo 
using conditional Hoxb8. Nature Methods 3, 287-293 (2006). 
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BATF-JUN is critical for IRF4-mediated 


transcription in T cells 


Peng Li!*, Rosanne Spolski'*, Wei Liao', Lu Wang’, Theresa L. Murphy’, Kenneth M. Murphy”? & Warren J. Leonard! 


Interferon regulatory factor 4 (IRF4) is an IRF family transcription 
factor with critical roles in lymphoid development and in regulating 
the immune response’”. IRF4 binds DNA weakly owing to a 
carboxy-terminal auto-inhibitory domain, but cooperative binding 
with factors such as PU.1 or SPIB in B cells increases binding affinity’, 
allowing IRF4 to regulate genes containing ETS-IRF composite 
elements (EICEs; 5’-GGAAnnGAAA-3’)'. Here we show that in 
mouse CD4* T cells, where PU.1/SPIB expression is low, and in B 
cells, where PU.1 is well expressed, IRF4 unexpectedly can cooperate 
with activator protein-1 (AP1) complexes to bind to AP1-IRF4 
composite (5’-TGAnTCA/GAAA-3’) motifs that we denote as 
AP1-IRF composite elements (AICEs). Moreover, BATF-JUN family 
protein complexes cooperate with IRF4 in binding to AICEs in 
pre-activated CD4* T cells stimulated with IL-21 and in Ty17 
differentiated cells. Importantly, BATF binding was diminished 
in Irf4~'~ T cells and IRF4 binding was diminished in Batf ‘~ T 
cells, consistent with functional cooperation between these factors. 
Moreover, we show that AP1 and IRF complexes cooperatively 
promote transcription of the I110 gene, which is expressed in 
Ty17 cells and potently regulated by IL-21. These findings reveal 
that IRF4 can signal via complexes containing ETS or AP1 motifs 
depending on the cellular context, thus indicating new approaches 
for modulating IRF4-dependent transcription. 

There are nine mammalian interferon regulatory factor (IRF) family 
members, IRF1 to IRF9’, that collectively have broad actions within and 
beyond the immune system’. IRFs were identified on the basis of their 
induction by type I interferons (IFN-o/B), and some IRFs are induced 
by Toll-like receptors and other pattern-recognition receptors’*. IRF4 
expression is restricted to the immune system and is induced in T cells 
by T-cell receptor stimulation**. IRFs contain an amino-terminal DNA- 
binding domain that recognizes 5'-GAAAnnGAAA-3’ motifs, but 
IRF4 only weakly binds DNA due to its carboxy-terminal auto- 
inhibitory domain’. In B cells, PU.1 or the related factor SPIB relieves 
auto-inhibition to increase binding affinity, allowing IRF4 to regulate 
genes expressing composite 5’-GGAAnnGAAA-3’ ETS-IRF consensus 
motif elements (EICEs)**, including « and 4 immunoglobulin light 
chain genes. Whereas PU.1 binds directly to EICEs, efficient IRF4 bind- 
ing requires phosphorylated, DNA-bound PU.1'. IRF4 also acts in T 
cells’, contributing to development of multiple Ty; cell subsets’, with 
defective Ty1°, Ty2°', Ty9” and Ty17” differentiation in its absence. 
Using genome-wide chromatin immunoprecipitation coupled to DNA 
sequencing (ChIP-Seq), we previously demonstrated that IRF4 coop- 
erates with STATS to control IL-21-induced Prdm1 expression and that 
these factors globally regulate IL-21-mediated gene expression”. 
Moreover, we found that IRF4 expression is required for normal 
STAT3 binding in vivo and for development of an additional T-cell 
effector population, namely Tfh cells". 

As anticipated, analysis of IRF4 ChIP-Seq peaks from B cell libraries 
we previously generated” identified EICEs as the top motif (Fig. 1a, b). 
In contrast, EICEs were not readily identified in IRF4 ChIP-Seq 


libraries from activated T cells (Fig. 1c, d) or Ty17 cells (Fig. le), 
consistent with T cells expressing much less PU.1 than B cells 
(Fig. 1f and Supplementary Fig. 1). Instead, examination of the top 
1,000 peaks (sorted by P values) from libraries from pre-activated 
T cells, unstimulated or stimulated with IL-21, or from T}17 cells, 
unexpectedly revealed that the top IRF4 ChIP-Seq motifs were 
activator protein-1 (AP1) 5'-TGA[G/C]TCA-3’ motifs (Fig. 1c-e). 
IRF8, which like IRF4 also interacts with PU.1°, bound to some 
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Figure 1 | Preferential IRF4 binding to AP1 motifs in T cells. a-e, IRF4 
motifs in B cells not stimulated (a) or stimulated (b) with IL-21, in pre-activated T 
cells not stimulated (c) or stimulated (d) with IL-21 for 1 h, or in T}717 cells 
(e). f, Sfpil/PU.1 mRNA expression in the indicated populations based on RNA- 
Seq. g, 5’-TTTC/GAAA-3' motif distribution relative to the 5’-TGAnTCA-3’ 
AP1 motif in 1000 sequences with strong IRF4 binding, and the number of IRF4 
ChIP-Seq peaks at each spacing. The blue line shows the 5'-TTTC/GAAA-3’ 
distribution in 1000 random sequences; motif frequency was approximately 0.5%. 
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AP 1-containing sites but dominantly bound to canonical IRF motifs 
with tandem GAAA motifs (5'-GAAA[C/G][T/A]GAAA[G/C]-3’) 
(Supplementary Fig. 2a, b). To elucidate the IRF4-AP1 interaction, 
we examined whether IRF4 GAAA/TTTC core motifs were associated 
with AP1 sites, and within IRF4 ChIP-Seq peaks, we found enrichment 
of these motifs adjacent to or five base pairs away (four intervening 
base pairs) from AP1 sites (Fig. 1g and Supplementary Fig. 3), in 
contrast to their overall random distribution (Fig. 1g, blue line), 
indicating binding cooperativity for AP1 and IRF4. We denote these 
AP1-IRF4 composite elements as AICEs. 

Of 14,838 IRF4 ChIP-Seq peaks, 5,304 bound within genes 
annotated by RefSeq (Fig. 2a), and analysis of our published 
Affymetrix array data sets'* revealed that 2,356 of these genes were 
regulated by IL-21 at 1, 6 or 24h (Fig. 2b). RNA-Seq analysis revealed 
markedly lower expression of some of these genes, including Prdm1 
and 110, in IL-21-stimulated Irf4'~ than in wild-type T cells (Fig. 2c), 
underscoring the importance of IRF4 for their expression. To 
characterize the IRF4-binding complex in T cells, we analysed ChIP- 
Seq libraries from T};17-differentiated cells and IL-21-stimulated pre- 
activated T cells, focusing on JUN family proteins and BATF, which 
can heterodimerize with JUN proteins to bind to AP1 motifs and is 
critical for Ty17 differentiation’*’, a process promoted by IL-21*°. 
In Ty17 cells, approximately 54% (11,693 out of 21,775) of the IRF4 
binding sites overlapped with BATF binding sites, and approximately 
65% of the BATF sites overlapped with IRF4, indicating substantial 
co-localization of these factors (Fig. 2d). As expected, the dominant 
binding motif for BATF was an AP1 motif (5'-TGA[G/C]TCA-3’) 
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Figure 2 | Functionally important IRF4 binding to AP1 sites. a, Distribution 
of IRF4 ChIP-Seq peaks. UTR, untranslated region. b, Venn diagram of genes 
bound by IRF4 and regulated by IL-21. c, Heat map of 130 genes regulated in 
pre-activated CD4* T cells after 1 h stimulation with IL-21. WT, wild type. 

d, IRF4, BATF, and JUND binding in T}17 cells. e, f, ChIP-Seq data for IRF4, 
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(Supplementary Fig. 4), and IRF4, BATF and JUN family proteins 
co-localized by ChIP-Seq (Fig. 2d and Supplementary Fig. 5a, b). 
The specificity of these data was indicated by essentially absent 
ChIP-Seq peaks in the IgG control as well as with anti-IRF4 in 
Irf4'~ cells or anti-BATF in Batf ‘~ cells (Fig. 2e). ChIP-Seq peaks 
for STAT3 also globally co-localized with IRF4 (Supplementary Fig. 5c), 
including at the Prdm1 region previously studied’* and in the [/21 
promoter and Jl17a 3’ region (Fig. 2e). Interestingly, approximately 
50% of genes (1,167 out of 2,356) with co-localization of these tran- 
scription factors in T}17 cells were also induced by IL-21 in activated 
CD4* T cells. STAT3 binds to GAS motifs rather than AP1 motifs, but 
its co-localization at AP1 motifs might be explained by the ability of 
STAT3 to physically associate with JUN”. 

To investigate potential cooperative binding between IRF4 and AP1 
complexes, we identified strong IRF4 binding sites containing a 
5'-GAAA-3' motif adjacent to or 5 bp away from the AP1 motif (a 
preferred spacing in Fig. 1g; Supplementary Table 1 lists genes with 
these sites). We selected sites in the 1110, Ikzf2 (which encodes Helios) 
and Ctla4 genes and confirmed co-localization of IRF4, STAT3, BATF 
and JUN by ChIP-Seq (Fig. 2f). The [110 gene, which is expressed in 
IL-21-stimulated CD4* T cells, CD8* T cells and polarized T}17 
cells’, contained two IRF4 binding sites with associated AP1 motifs 
(Fig. 2f). We performed electrophoretic mobility shift assays (EMSAs) 
with T};17 nuclear extracts and a probe corresponding to the conserved 
noncoding sequence, CNS9, located approximately 9.1 kilobases 5’ of 
the 1/10 transcription start site (Fig. 3a, [110 peak 1), which is known to be 
an I/10 regulatory element™*. A strong complex formed, but it was 
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Figure 3 | Cooperative IRF4/BATE/JUN binding. a, Probes for IRF4 ChIP- 
Seq peaks in 1/10, Ikzf2, and Ctla4, and an AP1 probe; AP1-IRF motifs are 
underlined. The AP1 probe is from ref. 15. b, EMSA with J/10 peak1 probe (WT 
or IRF, AP1, or IRF4/AP1 double mutants) and T}17 nuclear extracts. c, EMSAs 
using 1/10 peak1, [110 peak2, Ikzf2, or AP1 probes and T};17 nuclear extracts; 
supershifting was performed as indicated. Ctrl Ig, control immunoglobulin. 

d, EMSAs using nuclear extracts from 293T cells transfected with indicated 
cDNAs; Ty;717 extract vs. no extract are also shown (10 vs, 9 lane). e, EMSAs 
with nuclear extracts from 293T cells transfected with IRF4, JUND, and BATF; 
supershifting was with anti-IRF4, anti-JUND, or anti-BATF. EMSA were 
performed at least 3 times. 


reduced when the 5’-GAAA-3’ motif was mutated and abolished when 
the AP1 motif was mutated (Fig. 3b). Supershifting with antibodies 
revealed that IRF4, JUNB, JUND and BATF were components of the 
complex (Fig. 3c); these factors also bound to I/10 peak 2 and Ikzf2 
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probes (Fig. 3c). As expected, no shift was seen when nuclear extracts 
were omitted (Supplementary Fig. 6). Antibodies to FOS and FRA2 had 
a minor effect on the [110 peak 1 and no effect on the [110 peak 2 and Ikzf2 
complexes (Fig. 3c). In contrast, an AP1 consensus probe complex was 
not supershifted by anti-IRF4 but was supershifted by antibodies against 
BATF, FOS and FRA2 (Fig. 3c). We next studied binding to the 110 peak 
1 IRF4 motif in B cells, Ty2 cells and Ty9 cells, which all express 1/10. 
B-cell nuclear extracts formed a complex supershifted by antibodies 
against IRF4, BATF and JUNB, but not PU.1 (Supplementary Fig. 7a), 
even though anti-PU.1 antibody supershifted a complex formed with 
an EICE probe from the immunoglobulin 4 light chain enhancer 
(Supplementary Fig. 7b). Thus, EICEs were the most common IRF4- 
containing complexes in B cells (Fig. 1 a, b), but IRF4/AP1 AICEs also 
formed in these cells (Fig. 1b). Although T}2 and Ty9 polarized cells 
were reported to express PU.1 protein”**°, RNA-Seq analysis showed 
little PU.1 messenger RNA in these cells (Fig. 1f), and EMSAs showed 
IRF4-BATF-JUNB interactions but no PU.1-binding activity 
(Supplementary Fig. 7c). To determine whether IRF4 and BATF-JUN 
proteins cooperatively bound to DNA, we used nuclear extracts from 
293T cells transfected with various combinations of IRF4, BATF and 
JUNB or JUND and performed EMSAs with 110, Ctla4 and Ikzf2 probes. 
Little if any binding activity was observed with extracts from 293T cells 
expressing IRF4, JUND or BATF alone, certain pairwise combinations 
showed some binding, but strong binding was seen with extracts con- 
taining all three proteins, indicating cooperative binding to these sites 
(Fig. 3d, left); this was also observed when JUNB was substituted for 
JUND (Fig. 3d, right). Cooperative binding was indicated by slower 
mobility, particularly of the 1110 peak 1 probe (Fig. 3d). Although mobility 
changes for other probes were less evident, even on 4% or 7% gels (not 
shown), supershifting experiments confirmed that IRF4, JUND and 
BATF were present in complexes formed with each probe (Fig. 3e). 
To examine the functional significance of the [110 IRF4 motif, we 
first analysed [10 mRNA expression in Irf4_'~ T cells and found much 
lower I/10 mRNA in response to IL-21, anti-CD3 + anti-CD28, or 
IL-21 + anti-CD3 + anti-CD28 than was observed in wild-type cells 
(P < 0.02 at both time points; Fig. 4a). Correspondingly, [110 luciferase 
reporter activity was potently induced by IL-21 or anti-CD3 + anti- 
CD28, and more so by IL-21 + anti-CD3 + anti-CD28, but expression 
was diminished when the GAAA IRF4 motif or associated AP 1 site was 
mutated (P < 0.02; Fig. 4b). Moreover, IRF4, JUNB, JUND and BATF 


Figure 4 | Cooperative action of 
IRF4, BATF, and JUN. a, I110 
mRNA expression relative to Rp/7 in 
WT or Irf4'~ T cells pre-activated 
and treated as indicated for 4 or 24h. 
(n = 3; mean + s.d.). b, WT or 
mutant 110 reporter constructs 
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each bound to a wild-type probe spanning this region, but binding was 
diminished when Irf4-‘" nuclear extracts were used (Fig. 4c). 
Moreover, in ChIP-Seq experiments, there was markedly decreased 
binding of IRF4 in Batf‘~ T cells and of BATF and JUN in Irf4/~ 
T cells (Fig. 4d) at the 110 locus but also globally (Fig. 4e), including for 
example at the I117a gene (Fig. 4f), consistent with defective Il17a 
expression and Ty17 differentiation in Irf4‘~' and Batf ‘~'° T cells. 
These results indicate cooperative binding and transcriptional activa- 
tion by IRF4 and BATF-JUN family proteins. 

IRF4 is a pleiotropic IRF family transcription factor with broad 
immunological actions. Its critical role in regulating immunoglobulin 
genes involves functional cooperation with the largely B-cell-restricted 
factor PU.1. We now demonstrate that in T cells, where PU.1 expression 
is low, IRF4 instead functionally cooperates with AP1 family proteins to 
act via AICEs, with functional cooperation with BATF and JUN family 
proteins in pre-activated T cells stimulated with IL-21 as well as in Ty2, 
Ty9 and Ty17 polarized cells. Interestingly, a number of genes we 
selected for analysis (1110, Ctla4, I117a, Prdm1 and Ikzf2) were function- 
ally grouped in a study of T}2 inhibitory effector cells during chronic 
inflammation as preferentially expressed in IL-10* versus IL-10~ cells”; 
it will be interesting to determine whether IRF4/AP1-dependent gene 
expression helps to explain these observations. Although IRF4 and BATF 
cooperatively bound in the context of AICEs, it was unclear if they 
associated in the absence of these sites. In T cells, we could co-precipitate 
BATF and JUN (Supplementary Fig. 8), but we only co-precipitated IRF4 
and JUN in a single experiment and could not co-precipitate BATF and 
IRF4. Thus, ifa direct interaction occurs, it may be relatively weak, but the 
marked decrease of BATF binding in Irf4_'~ and of IRF4 in Batf ‘~ cells 
(Fig. 4) indicates cooperative binding to AICEs. This binding is depend- 
ent on the BATF leucine zipper domain”. The ability of IRF4 to act via 
two types of complexes, PU.1-IRF4 EICEs in B cells and AP1-IRF4 
AICEs in T cells and to some degree in B cells, highlights mechanisms 
for IRF4-mediated transcriptional activation. The identification of the 
IRF4—-AP1 connection suggests new approaches may be employed to 
selectively target certain actions of IRF4, potentially allowing ways to 
manipulate the immune response in a cell-type-restricted fashion. 


METHODS SUMMARY 

T and B cells were cultured in standard medium. Cells were pre-activated with 
anti-CD3 + anti-CD28 for 3 days, rested overnight and stimulated with IL-21 for 1 
h (ChIP-Seq) or 4 h (EMSAs). Ty17 polarization was performed by standard 
methods and unlike CD4* T cells, Ty317 polarized cells were not stimulated with 
IL-21. WT, Batf ‘~ and Irf4-'~ mice were 6-8 weeks old C57BL/6 background 
mice of mixed gender. All experiments with mice were performed under protocols 
approved by the NHLBI Animal Care and Use Committee, and followed NIH 
guidelines for use of animals in intramural research. ChIP-Seq'*” and RNA-Seq’? 
experiments were performed as previously described. ChIP-Seq and RNA-Seq 
libraries are summarized in Supplementary Table 2; details of binding site iden- 
tification and motif analysis are in the online methods. For reporter assays, acti- 
vated T cells were electroporated with reporter plasmid and pRLTK, and dual 
luciferase assays performed. For EMSAs, nuclear extracts'' were bound to **P- 
labelled probes. For supershifts, extracts were pre-incubated with indicated anti- 
bodies, before analysis on 5% polyacrylamide gels. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Cells and cell culture. Mouse T and B cells were isolated using kits (Miltenyi) and 
cultured in RPMI-1640 medium containing 10% fetal bovine serum. Cells were 
pre-activated with plate-bound anti-CD3 (2ugml~') + soluble anti-CD28 
(lug ml!) for 3 days, rested overnight and stimulated with IL-21 (100 ng ml!) 
for 1 h (ChIP-Seq) or 4h (EMSAs). For Ty17 polarization, cells were subjected to 2 
rounds of polarization with anti-CD3 + anti-CD28 for 4 days in the presence of 
IL-6 (10 ng ml-'), TGE-B (2 ng ml'), anti-IFN-y (10 lug ml!), and anti-IL-4 
(10 pg ml '). Unlike the CD4* T cells, Ty17 polarized cells were not stimulated 
with IL-21. 

Mice. Wild-type, Batf '~ and Irf4-'~ mice were 6-8 weeks old C57BL/6 back- 
ground mice of mixed gender. All experiments with mice were performed under 
protocols approved by the NHLBI Animal Care and Use Committee, and followed 
NIH guidelines for use of animals in intramural research. 

ChIP-Seq experiments. We used chromatin from approximately 2 X 10’ cells, 
which corresponds to approximately100 ng of DNA, for each ChIP-Seq library 
and antibodies to IRF4 (Santa Cruz, sc-6059), STAT3 (Invitrogen), BATF, JUN 
(Abcam, ab31419), JUNB (Santa Cruz, sc-73), and JUND (Santa Cruz, sc-74). The 
ChIPed DNA fragments were blunt-ended, ligated to adaptors, and sequenced 
using an Illumina 1/2G Genome Analyzer and HiSeq2000 platform to obtain reads 
of 25-50 bp, depending on the platform. Sequenced reads were aligned to the 
mouse genome (NCBI36/mm8, February 2006 assembly) with Bowtie 0.12.4"); 
only uniquely mapped reads were retained. Uniquely mapped reads and non- 
redundant reads numbers for each library are listed in Supplementary Table 2. 
The output of Bowtie was converted to BED files, which represent the genomic 
coordinates of each read. Reads were mapped into non-overlapping 200-bp windows, 
and the location of reads on positive (negative) strand was shifted + 75 bp from its 
5’ start to determine the approximate centre of the DNA fragment associated with 
the reads. With these locations, the reads in each 200-bp summary window were 
counted. BedGraph files were generated and viewed using the UCSC genome 
browser, and we aligned the BATF, IRF4, JUN and STAT3 binding sites in 
IL-21-stimulated CD4* T cells or T}417 cells. Some data were also performed in 
cells from Batf ‘~ or Irf4-/~ mice as well as from WT mice. Because each antibody 
presumably has a different binding affinity, we scaled libraries that used the same 
antibodies to normalize binding strength, but libraries from different antibodies 
were not scaled. 

Database of genes. RefSeq gene database (mm8 revision) was downloaded from 
the UCSC genome browser; 24,769 genes were used for RNA-Seq analysis and 
genome-wide binding site distribution analysis. 

Identification of binding sites. ChIP-Seq experiments were performed to 
identify transcription factor binding sites in splenic B cells, CD4* T cells and 
Tyl7 cells. We used MACS 1.3.7.1° to call binding sites (peaks) relative to a 
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control IgG library as input control. The P-value threshold was set as 1 X 10-"”. 


To call a peak, the total number of reads in each peak region need to be >20 with 
FDR <0.1. Only non-redundant reads were analysed for peak calling. 

De novo motif discovery. Owing to the computational complexity, for each 
library we selected the top 1,000 peaks with lowest P values, extracted 100 bp of 
DNA sequence centred on the ‘summit’ for each peak, and performed de novo 
motif analysis using MEME” to characterize the IRF4/IRF8 consensus binding 
motifs in B cells, T cells, as well as T};17 cells. Motif discovery was also performed 
for other transcription factors, including BATF, STAT3, JUN, JUNB and JUND. 
Where indicated, the five most significant motifs are shown; motifs were sorted by 
consensus E-values’ or by motif occurring frequencies. 

Motif scanning analysis. For the motif scanning analysis related to Fig. 1g, AP1 
motifs from Fig. 1d were centred and 100 bp of DNA sequence 5’ and 3’ were 
analysed for the proximal IRF motif, 5’-GAAA/TTTC-3’. Matched motif hits were 
counted at each nucleotide position and then plotted using a histogram, with 
breaks set at 200. 

Genome-wide distribution analysis. The 5’ UTR, 3’ UTR, introns, exons and 
intergenic regions were defined according to the RefSeq database. Promoter 
regions were defined as regions extending 15 kb 5’ of the transcription start site. 
Peaks up to 5 kb 3’ of the transcription end site were considered as binding within 
the gene body. 

Reporter assays. CD4* T cells were activated for 24h with anti-CD3 + anti- 
CD28, washed, rested overnight, and 10’ cells electroporated with 20 jig reporter 
plasmid and 1 pg pRLTK in 0.2 ml RPMI using 960 uF and 250 V. Cells were 
immediately stimulated with IL-21, anti-CD3/anti-CD28, or IL-21 + anti-CD3/ 
anti-CD28. Dual luciferase assays were performed 7 h later (Promega). Shown is 
luciferase activity relative to the control pRLTK activity. 

Electrophoretic mobility shift assays. Nuclear extracts were prepared as 
described"*. Binding reactions contained 5 1g extract, 1.5 1g poly-deoxyinosinic- 
deoxycytidylic acid (poly dI:dC), and 30,000 c.p.m. of *’P-labelled probe. For 
supershift analysis, extracts were pre-incubated for 20 min on ice with antibodies 
to IRF4 (M-17), JUNB (N-17), JUND (329), BATF (WW8), FOS (4), FRA2 
(H103), PU.1 (T-21) (Santa Cruz Biotechnologies). Reactions were electrophoresed 
on 5% polyacrylamide gels in 0.5X TBE buffer. 
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Mitochondrial defect drives non-autonomous tumour 
progression through Hippo signalling in Drosophila 


Shizue Ohsawal, Yoshitaka Sato', Masato Enomoto!, Mai Nakamura!, Aya Betsumiya' & Tatsushi Igaki'? 


Mitochondrial respiratory function is frequently impaired in human 
cancers’ *. However, the mechanisms by which mitochondrial 
dysfunction contributes to tumour progression remain elusive. 
Here we show in Drosophila imaginal epithelium that defects in 
mitochondrial function potently induce tumour progression of sur- 
rounding tissue in conjunction with oncogenic Ras. Our data show 
that Ras activation and mitochondrial dysfunction cooperatively 
stimulate production of reactive oxygen species, which causes 
activation of c-Jun amino (N)-terminal kinase (JNK) signalling. 
JNK cooperates with oncogenic Ras to inactivate the Hippo path- 
way, leading to upregulation of its targets Unpaired (an interleukin-6 
homologue) and Wingless (a Wnt homologue). Mitochondrial 
dysfunction in Ras-activated cells further cooperates with Ras sig- 
nalling in neighbouring cells with normal mitochondrial function, 
causing benign tumours to exhibit metastatic behaviour. Our find- 
ings provide a mechanistic basis for interclonal tumour progression 
driven by mitochondrial dysfunction and oncogenic Ras. 

Cell-cell interactions between oncogenic cells and surrounding 
normal cells in the tumour microenvironment play crucial roles in 
cancer progression’. To study the genetic basis of tumour progression 
through cell-cell communications, we performed a genetic screen in 
Drosophila to identify mutations that cause oncogenic Ras” '*-expressing 
cells to induce non-autonomous growth of surrounding tissue. Clones of 
cells expressing Ras” * overgrow and develop into benign tumours in the 
eye imaginal epithelium (Fig. 1e)°. Using the MARCM technique’, we 
introduced further homozygous mutations in GFP-labelled RasY'?- 
expressing clones and screened for those that triggered non-autonomous 
growth (nag) (Supplementary Fig. 1a). We isolated nag mutations that 
resulted in enlarged larval eye discs (Fig. 1f and Supplementary Fig. 2a) 
and subsequent overgrown ‘folded-eyes’ in the adult (Fig. 1b). These nag 
mutations did not promote autonomous growth (Fig. la-f and 
Supplementary Fig. 2b), indicating that the mutant clones stimulated 
growth in the surrounding tissue (Supplementary Fig. 2a, c). Indeed, 
increased cell proliferation was evident in wild-type tissue adjacent to 
Ras’ '’/nag ‘~ clones (Fig. 1g). The non-autonomous overgrowth was 
not caused by nag mutations alone (Fig. 1d, e and Supplementary Figs 1j, 
k and 20). 

Analysis of nag mutations identified lesions in a series of genes 
required for mitochondrial respiratory function: either genes encoding 
components of the mitochondrial respiratory complexes or mitochon- 
drial ribosomal proteins (Supplementary Fig. 11, m). Mutations in dif- 
ferent components of the mitochondrial respiratory system, including 
Pdsw (nag-19, a component of complex 1)’, mRpL4 (nag-24, a mitochon- 
drial ribosomal protein)® and CoVa (a component of complex IV)°, 
cooperated with Ras’'* to induce indistinguishable non-autonomous 
overgrowth (Fig. 1b and Supplementary Figs 1b-i and 2). These results 
show that impairments of the mitochondrial respiratory chain in Ras- 
activated cells trigger overgrowth of surrounding tissue. 

It has been reported that dying cells stimulate “compensatory pro- 
liferation’ of their surrounding cells’. However, we found that neither 
mitochondrial mutations themselves nor in combination with Ras’ '? 


caused cell death (Supplementary Fig. 3c-f). Furthermore, blocking 
caspase activity by baculoviral p35, which enhances compensatory 
proliferation, did not cause mitochondrial mutant clones to induce 
non-autonomous overgrowth (Supplementary Fig. 3a, b). These results 
indicate that the non-autonomous overgrowth is not caused by com- 
pensatory proliferation but by specific cooperation between Ras activa- 
tion and mitochondrial dysfunction. 

To dissect the mechanism underlying the non-autonomous over- 
growth, we conducted a secondary genetic screen to identify dominant 
modifiers of the nag phenotype. Using a series of chromosomal defi- 
ciencies, we introduced heterozygous genomic deletions in flies bear- 
ing Ras’ '?/CG9140"** *8/~ clones (CG9140 encodes a component of 
complex I). We found that one of the strong dominant suppressors of 
nag, Df(3R)H-B79, uncovered Stat92E, a component of the JAK/STAT 
pathway (Supplementary Fig. 4a, b, 0). The Drosophila JAK/STAT 
pathway is activated by binding of its ligand Unpaired (Upd) to its 
receptor Domeless (Dome), thereby activating the transcription factor 
Stat92E, which stimulates cell proliferation’®. Heterozygosity for either 
Stat92E or dome in animals bearing RasY'*-expressing clones with 
mitochondrial mutations (Ras’'*/mito ’~) notably suppressed non- 
autonomous overgrowth (Fig. 1h, iand Supplementary Fig. 4c-h, 1, m, 
p-u). In addition, reducing upd expression suppressed the nag pheno- 
type (Fig. 1j and Supplementary Fig. 4i, j, n, v, w). Furthermore, 
although neither Ras activation nor mitochondrial dysfunction 
induced Upd expression, combination of these two alterations notably 
upregulated Upd (Fig. 1l-n and Supplementary Figs 5a-c, e, f and 
6a-c). These data indicate that Ras’!?/mito-’~ clones cause non- 
autonomous overgrowth through the Upd-JAK/STAT pathway. 

It has been reported that upd expression is induced by either JNK 
signalling or Notch signalling''*. We found that blocking JNK in 
Ras’ '?/mito ‘~ clones by a dominant-negative form of the Drosophila 
JNK Bsk (Bsk?) abolished Upd induction (Fig. lo and Supplementary 
Figs 5d-f and 6d) and notably suppressed non-autonomous overgrowth 
(Fig. 1k and Supplementary Fig. 7). In addition, although Ras activation 
or mitochondrial dysfunction caused only low-level expression of the 
puc-lacZ JNK activity reporter’’, combination of Ras activation and 
mitochondrial dysfunction cooperatively activated JNK signalling 
(Fig. lp and Supplementary Fig. 8). On the other hand, reducing 
Notch activity did not affect the overgrowth (data not shown). These 
data show that RasY!”/mito’~ clones cooperatively activate JNK sig- 
nalling, which causes Upd induction. 

We next examined the mechanism of JNK activation in Ras 
mito ‘~ clones. Mitochondrial malfunction is often associated with 
reactive oxygen species production, which activates intracellular stress 
signalling such as JNK signalling’®. Interestingly, although neither Ras 
activation nor mitochondrial dysfunction (Pdsw ’ or CoVa “ ) 
caused oxidative stress, the combination of these two alterations 
strongly induced oxidative stress (Fig. 2a—c and Supplementary Fig. 
9a, b), as visualized by the gstD-green fluorescent protein (GFP) 
reporter’. In addition, staining with dihydroethidium (DHE), a fluor- 
escent probe for superoxide anion (O,_ ), showed that RasY!”/Pdsw’/~ 
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Figure 1 | Ras’!*/mito ’~ clones cause non-autonomous overgrowth 
through JNK-mediated Upd induction. a, b, Scanning electron microscopic 
(left), light level (middle) and fluorescent (right) images of adult eye bearing 
GFP-labelled wild-type (a) or Ras’'?/Pdsw"®'?-’~ (b) clones. c-f, Eye- 
antennal disc bearing GFP-labelled wild-type (c), Pdsw ‘~ (d), Ras’” (e) or 
Ras’ !?/Pdsw /~ (f) clones. g, 5-Bromodeoxyuridine (BrdU) labelling in eye- 
antennal disc bearing GFP-labelled RasY'*/Pdsw’‘~ clones. Normal BrdU 
incorporation in the second mitotic wave is indicated (g, right panel, arrow). 


clones cooperatively induced O, production (Fig. 2d-f). Notably, 
blocking oxidative stress by glutathione peroxidase (GPx)'* in mutant 
clones abrogated JNK activation (Fig. 2h, compare with Fig. 1p) as well 
as the non-autonomous overgrowth (Fig. 2iand Supplementary Fig. 9e, 
h, j). On the other hand, blocking JNK in Ras’ !?/mito’~ clones had no 
effect on the oxidative stress (Fig. 2g and Supplementary Fig. 9c). These 
results indicate that oxidative stress is the upstream trigger for JNK 
activation in Ras” '*/mito ‘~ clones. 

Intriguingly, we found that Ras”'?/CoVa ‘~ clones did not produce 
O, , despite the strong induction of oxidative stress (Supplementary 
Fig. 9b, d). This is consistent with previous observations that O02 was 
induced by defects in complex I Pdsw but not by defects in complex IV 
CoVa"’. These observations raised the possibility that Ras”'?/CoVa /~ 
clones produce H,O;, whereas Ras’!?/Pdsw ’~ clones produce O,~ 
(Supplementary Fig. 9m). Indeed, reducing O by superoxide dis- 
mutase (SOD) in Ras”'!*/Pdsw ’~ clones, but not in Ras’ !?/CoVa /— 
clones, suppressed non-autonomous overgrowth (Fig. 2) 
Supplementary Fig. 9f, i, k). Furthermore, reducing HO) in Ras 
CoVa ‘~ clones by Catalase suppressed the non-autonomous over- 
growth (Supplementary Fig. 9g, 1). Consistent with these results, 
another fluorescent reactive oxygen species indicator, H,DCFDA, 
which can detect H,O,, strongly labelled Ras”'*/CoVa ’~ clones 
(Supplementary Fig. 9n-p). These data indicate that Ras activation 
in cells with impaired mitochondrial respiratory function triggers 
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RasV'?/Pdsw (nag-19)- 


/Pdsw 


h-k, Eye of stat92E/+ (h), dome/+ (i), upd1-RNAi (expressed in clones) (j) or 
BskPN (expressed in clones) (k) flies bearin: Ras’ !?/Pdsw ‘~ clones. l-o, Eye- 
antennal disc bearing GFP-labelled Pdsw ’ (1), Ras’? (m), Ras”'*/Pdsw /— 
(n) or Ras’ ?/Pdsw~’~ + BskPN (0) clones stained with anti-Upd. Arrowheads 
indicate representative clones upregulating Upd. p, GFP-labelled Ras”'”/ 
Pdsw’’~ clones were induced in puc-LacZ/+ eye-antennal disc and stained 
with anti-B-gal. Arrowheads indicate representative clones activating JNK. 


oxidative stress through production of either O. or H,O,, which 
activates JNK. 

Our genetic data establish that JNK activation in Ras’'?/mito” 
clones is necessary to cause nag phenotype. However, JNK activation is 
not sufficient, as clones of cells solely activating JNK signalling (for 
example, by overexpression of the Drosophila tumour-necrosis factor 
Eiger) do not cause non-autonomous overgrowth’ (Fig. 3a), indicating 
that Ras”'?/mito ’~ clones must induce an additional downstream 
effect(s) essential for the induction of nag. Interestingly, we found that 
co-activation of JNK and Ras signalling caused Upd induction 
(Fig. 3b), which is mediated through the Hippo pathway (see below). 
The Hippo pathway is an evolutionarily conserved kinase cascade that 
negatively regulates tissue growth by phosphorylating and inhibiting 
the transcriptional co-activator Yorkie (Yki). Inactivation of the Hippo 
pathway reduces Yki phosphorylation, which results in induction of 
its target genes”. Recent studies have shown that upd is a target of 
Yki'**""* and Yki activity is mediated by JNK signalling during 
intestinal regeneration’*~*. Interestingly, co-activation of JNK and 
Ras signalling upregulated Yki target genes such as diap1 and cyclin 
E, whereas neither JNK nor Ras activation alone did not (Fig. 3d—fand 
Supplementary Fig. 10a—c). Similarly, the combination of Ras activa- 
tion and mitochondrial dysfunction caused upregulation of Yki targets 
expanded (ex), diap1 and cyclin E (Fig. 3j-1 and Supplementary Fig. 
10e-h, m). Furthermore, forced activation of the Hippo kinase cascade 


j= 


©2012 Macmillan Publishers Limited. All rights reserved 


Ras¥12 


Ras”1? /Pdsw/- 


Ras¥'2 /Pdsw~> Ras¥!2 /Pdsw~- Ras¥'2/Pdsw7/- Ras? /Pdsw 


RasY1? /Pdsw7- 
re | 


Figure 2 | JNK signalling is activated by oxidative stress in Ras’'?/mito’— 
clones. a-c, Red fluorescent protein (RFP)-labelled Ras’! (a), Pdsw ’~ (b) or 
Ras’ !?/Pdsw ‘~ (c) clones were induced in gstD-GFP/+ eye-antennal disc. 
d-g, Eye-antennal disc bearing GFP-labelled Ras¥!? (d), Pdsw /— (e), Ras’ !?/ 
Pdsw~/~ (f) or Ras’ '*/Pdsw ’~ + BskPN (g) clones was incubated with DHE 
for 5 min. A slight increase in producation of reactive oxygen species was 
observed when incubated for 10 min (data not shown)'’. h, GFP-labelled 
Ras’ !?/Pdsw ’~ + GPx-1 clones were induced in puc-LacZ/+ eye-antennal 
disc and stained with anti-B-gal. i, j, Adult eye bearing Ras” '?/Pdsw ‘~ + GPx- 
1 (i) or Ras’ 7/Pdsw /— + SOD1 (j) clones. 


by Warts (Wts, a Yki kinase) or Hippo (Hpo, a Wts kinase), which 
inhibits Yki activity, as well as reducing yki expression by yki RNA 
interference (RNAi), blocked the induction of upd, diap1 and cyclin E 
both in Ras/JNK-activating clones (Fig. 3c, g and Supplementary 
Fig. 10d) and Ras”'’/mito ’~ clones (Fig. 3i, o and Supplementary 
Fig. 10i-l, n-p) and suppressed the non-autonomous overgrowth 
(Fig. 3p and Supplementary Fig. 11a, b, e, g, h). We also found that 
the expression of a secreted growth factor Wingless (Wg), another 
target of Yki?’, was notably upregulated in Ras’ '?/mito /- clones 
through Hippo signalling (Fig. 3m, o and Supplementary Fig. 10p). 
Furthermore, reducing wg expression in Ras’'*/mito ’~ clones sup- 
pressed the non-autonomous overgrowth (Supplementary Fig. 1 1c, d, 
i, j). Notably, combined expression of upd-RNAi and wg-RNAi in 
Ras’ !’/mito ‘~ clones almost completely suppressed nag phenotype 
(Fig. 3q and Supplementary Fig. 11f), indicating that induction of 
these factors through Hippo signalling is responsible for the non- 
autonomous overgrowth. Blocking JNK signalling in Ras”'?/mito /~ 
clones abolished the upregulation of all Yki target genes (Sup- 
plementary Fig. 12a-d). These data establish that JNK activation in 
Ras’ !*/mito ’~ clones cooperates with Ras signalling to inactivate the 
Hippo pathway, leading to induction of Yki targets upd and wg, which 
causes overgrowth of surrounding tissue. 

In human cancers, oncogenic cooperation occurs in a context of 
clonal evolution where somatic mutations in mitochondrial DNA 
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Figure 3 | Co-activation of JNK and Ras signalling causes upregulation of 
Upd and Wg through Hippo pathway inactivation. a—c, Eye-antennal disc 
bearing GFP-labelled Eiger (a), Ras” '*/Eiger (b) or Ras’ '”/Eiger + Wts (c) was 
stained with anti-Upd. Arrowheads indicate representative clones upregulating 
Upd (b). d-g, GFP-labelled Ras’? (d), Eiger (e), Ras’ ”/Eiger (f) or Ras’ !?/ 

Eiger + Wts (g) clones were induced in diap1-lacZ/+ eye-antennal disc and 

stained with anti-f-gal. Endogenous diap1-lacZ expression is weakly observed 


in the region posterior to the morphogenetic furrow. h, i, GFP-labelled Ras’ !?/ 


CoVa ‘ (h) or RasY!?/CoVa ‘~ + Wts (i) clones were induced in upd-lacZ/+ 
eye-antennal disc and stained with anti-B-gal. Endogenous upd-lacZ expression 
is indicated (h, i, bottom panels, asterisks). j-1, GFP-labelled CoVa ‘— 

(j), Ras’? (k) or Ras’ !?/CoVa/~ (1) clones were induced in expanded-lacZ/+ 
eye-antennal disc and stained with anti-B-gal. Endogenous expanded-lacZ 
expression is indicated (j-l, bottom panels, asterisks). m, 0, Eye-antennal disc 
bearing GFP-labelled RasY!7/CoVa ‘~ (m) or RasY!?/CoVa ’~ + Wts 

(0) clones was stained with anti-Wg. Endogenous expression domains of Wg 
are indicated (m, 0, bottom panels, asterisks). p, q, Adult eye bearing Ras” !?/ 
CoVa / + Wts (p) or Ras’ ?/Pdsw /~ + upd1-RNAi + wg-RNAi (q) clones. 


(mtDNA) or nuclear mitochondrial genes occur in a clone of cells 
within a developing tumour. Strikingly, when such situation was 
genetically created by inducing Ras’'?/mito ‘~ clones surrounded 
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by Ras” '?-expressing tissue, the tumour cells not only overproliferated 
but also exhibited invasive behaviour; the tumour cells migrated out of 
eye-antennal discs and invaded adjacent tissues such as brain hemi- 
spheres and ventral nerve cord (Fig. 4b, d and Supplementary Fig. 13c), 
whereas tissues entirely composed of Ras’ cells never displayed such 
invasive behaviour (Fig. 4a, c)*°. The invading tumour cells were 
mostly Ras’? cells with normal mitochondrial function, as Ras” !?/ 
mito ‘~ clones surrounded by Ras’! clones grew very poorly (Sup- 
plementary Fig. 13a, b). The invasive behaviour was strongly blocked 
by reducing stat92E gene dosage (Fig. 4e, f). Thus, the invasive growth 
is most probably due to Upd-JAK/STAT signalling”, although Wg 
may also contribute to the proliferation and/or invasion. Together, 
these observations show that clones of cells with impaired mitochon- 
drial function arising inside Ras-activated benign overgrowths pro- 
mote tumour progression of neighbouring Ras-activated tissue 
(Fig. 4g). 


Wild type 


Proliferation IL 


__and Upd, Wg —e Proliferation 
invasion 


Figure 4 | Ras’'”/mito ’~ clones drive tumour progression of surrounding 
tumours. a, b, Lateral view of larval cephalic complex with eye-antennal discs 
entirely expressing Ras’ !* and GFP (a) or Ras’? and GFP simultaneously 
bearing GFP-labelled Ras’ !*/Pdsw~/~ clones (b). c, d, Dorsal view of dissected 
larval central nervous system from animal with eye-antennal discs entirely 
expressing Ras”? and RFP simultaneously bearing either RFP-labelled Ras” '? 
(c) or REP-labelled Ras¥ !?/ mRpL4 / ~ (d) clones. e, f, Dorsal view of dissected 
larval central nervous system from animal with eye-antennal discs entirely 
expressing Ras”'” and GFP simultaneously bearing GFP-labelled Ras’ '7/ 
Pdsw ‘clones. The stat92E gene is heterozygous in f. Arrowheads in 

b, d, e indicate eye-antennal cells invading the ventral nerve cord. g, A model 
for non-autonomous tumour progression by Ras activation and mitochondrial 
dysfunction. 
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Although it has been extensively studied how Ras signalling stimu- 
lates cell-autonomous proliferation and thereby contributes to cancers”, 
our findings uncover a non-cell-autonomous oncogenic role of Ras in 
cooperation with mitochondrial dysfunction. Intriguingly, a similar 
non-autonomous overgrowth is caused by co-expression of constitu- 
tively activated (CA) Hep (a JNK kinase) and Raf (a downstream effector 
of Ras)*”. However, Raf~“/ Hep“ clones cause massive cell death?’, and 
blocking cell death by p35 in Hep“ clones is sufficient to cause non- 
autonomous overgrowth (data not shown), indicating that, although 
interesting cooperation, the Raf~“/Hep©*-induced non-autonomous 
overgrowth could be simply caused by compensatory proliferation. 
Thus, the non-autonomous tumour progression mechanism is specif- 
ically triggered by mitochondrial dysfunction in Ras-activated cells. 
Mutations in mitochondrial respiratory complexes cause cell cycle 
arrest®'**®, which could explain why inactivation of the Hippo pathway 
in Ras’'*/mito ‘~ clones does not promote cell-autonomous prolifera- 
tion. Intriguingly, we found that E-cadherin and Discs-large are down- 
regulated in these clones (data not shown), raising the possibility that 
Ras’'?/mito ’~ clones disrupt the epithelial integrity, which could 
further contribute to Hippo pathway inactivation”. 

Mitochondrial number and oxidative phosphorylation activities are 
often downregulated in many cancers*. The high rates of the mito- 
chondrial malfunctions in cancers are caused by somatic mutations in 
mtDNA'®, which encode components of mitochondrial respiratory 
complexes. Our genetic data in Drosophila not only provide a mech- 
anistic insight into tumour progression driven by mitochondrial 
dysfunction but also support Warburg’s hypothesis that mitochon- 
drial defects play an active role in the aetiology of cancer. Strikingly, 
mtDNA mutations were frequently identified in pancreatic cancers’, a 
highly aggressive cancer that mostly bears (~90%) Ras mutations”. 
Given the evolutionary conservation of the pathways examined, 
similar cooperative mechanisms could have a role in human cancers. 


METHODS SUMMARY 


Clones of mutant cells were generated in eye-antennal discs as described’. Detailed 
genotypes of the experimental individuals are described in Supplementary Informa- 
tion. The following antibodies and dyes were used: anti-Upd (D. Harrison), anti- 
Cyclin E (T. Orr-Weaver and H. Richardson), anti-cleaved caspase-3 (Cell 
Signaling), anti-B-gal (Sigma), anti-Wg (DSHB), anti-cleaved PARP (Cell 
Signaling), anti-phospho-JNK G9 (Cell Signaling), anti-BrdU BMG 6H8 (Roche), 
DHE (Invitrogen) and CM-H,DCFDA (Invitrogen). 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Fly strains and generation of clones. Fluorescently labelled mitotic clones 
were produced in larval imaginal discs using the following strains: y,w, eyFLP1; 
Act>y* >Gal4, UAS-GFP; FRT82B, Tub-Gal80 (82B tester-1), y,w, eyFLP1; 
Act>y' >Gal4, UAS-mmREP; FRT82B (82B REP-tester), Tub-Gal80, FRT40A, 
UAS-Ras"*; eyFLP6, Act>y*>Gal4, UAS-GFP (40A Ras tester-1), Tub- 
Gal80, FRT40A, UAS-Ras‘"*; eyFLP6, Act>y* >Gal4, UAS-His2A-mREP (40A 
Ras tester-2), Tub-Gal80, FRT40A; eyFLP6, Act>y* >Gal4, UAS-GFP (40A 
tester-1), eyFLP1; FRT40A; Act>y* >Gal4, UAS-GFP (40A tester-2), and 
FRT40A; eyFLP6, Act>y* >Gal4, UAS-His2A-mRFP (40A tester-3). Additional 
strains used were as follows: UAS-Eiger, CoVa‘"4 (a gift from U. Banerjee), 
gstD-GFP (a gift from D. Bohmann), UAS-Bsk” (a gift from T. Adachi- 
Yamada), UAS-GPx-1 (a gift from F. Missirlis), UAS-CD8-PARP-Venus 
(a gift from Y. Hiromi), upd-lacZ (a gift from H. Sun), UAS-Ras”'*, UAS- 
Catalase, UAS-SOD1, UAS-SOD2, Stat92E°*", Df(3R)H-B79 and dome@??™ 
(Bloomington Stock Center), pdswkt0101, cG9140*" SOs CG11455008%, 
REeSP*!7°4, mRpL4!™*, mRpL13/CG10602"", mRpL24°, mRpL1 78”, 
Df(1)BSC352 and os‘? ues (Drosophila Genetic Resource Center), UAS-upd1- 
RNAi (National Institute of Genetics Stock Center) and UAS-wg-RNAi 
(National Institute of Genetics Stock Center and Vienna Drosophila RNAi 
Center). 


Histology. Larval tissues were stained with standard immunohistochemical proce- 
dures using rabbit anti-Upd polyclonal antibody (1:500) (a gift from D. Harrison), 
mouse (1:500), rat (1:20) and guinea-pig (1:1,000) anti-Cyclin E antibodies (gifts 
from T. Orr-Weaver and H. Richardson), rabbit anti-cleaved caspase-3 antibody 
(Cell Signaling, 1:200), mouse anti-f-galactosidase antibody (Sigma, 1:500), mouse 
anti-Wg antibody (Developmental Studies Hybridoma Bank, 1:500), rabbit anti- 
cleaved PARP antibody (Cell Signaling, 1:200), mouse anti-phospho-JNK mono- 
clonal antibody G9 (Cell Signaling, 1:100), mouse anti-BrdU monoclonal antibody 
BMG 6H8 (Roche, 1:100), and were mounted with 4’,6-diamidino-2-phenylindole 
(DAPI)-containing SlowFade Gold Antifade Reagent (Molecular Probes). For BrdU 
labelling, larvae were dissected in Schneider’s medium and incubated in the same 
media with 10 mM BrdU for 5 min. After incubation with supplied buffer (Roche, 
5-Bromo-2’-deoxy-uridine Labelling and Detection Kit I) containing 50U ml! 
DNase I, imaginal discs were stained with anti-BrdU monoclonal antibody BMG 
6H8 (Roche, 1:100) and mounted as described above. Images were taken with a Zeiss 
LSM510, Zeiss LSM700 META or Leica SP5 confocal microscope. 

DHE staining. Larvae were dissected in PBS and incubated in it with 5 uM DHE 
(Invitrogen) for 5 min. Images were taken with a confocal microscope. 
CM-H,DCFDA staining. Larvae were dissected in PBS and incubated in the 
Schneider's medium with 5mM CM-H,DCFDA (Invitrogen) for 15min. 
Images were taken with a confocal microscope. 
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Black mamba venom peptides target acid-sensing 
ion channels to abolish pain 


Sylvie Diochot!?3*, Anne Baron!?3*, Miguel Salinas)? , Dominique Douguet!”, Sabine Scarzello7+, Anne-Sophie Dabert- Gay”, 
Delphine Debayle??, Valérie Friend'?*, Abdelkrim Alloui**, Michel Lazdunski’? & Eric Lingueglia’?? 


Polypeptide toxins have played a central part in understanding 
physiological and physiopathological functions of ion channels’. 
In the field of pain, they led to important advances in basic 
research*® and even to clinical applications”*. Acid-sensing ion 
channels (ASICs) are generally considered principal players in 
the pain pathway’, including in humans”. A snake toxin activating 
peripheral ASICs in nociceptive neurons has been recently shown 
to evoke pain’. Here we show that a new class of three-finger 
peptides from another snake, the black mamba, is able to abolish 
pain through inhibition of ASICs expressed either in central or 
peripheral neurons. These peptides, which we call mambalgins, 
are not toxic in mice but show a potent analgesic effect upon central 
and peripheral injection that can be as strong as morphine. This 
effect is, however, resistant to naloxone, and mambalgins cause 
much less tolerance than morphine and no respiratory distress. 
Pharmacological inhibition by mambalgins combined with the 
use of knockdown and knockout animals indicates that blockade 
of heteromeric channels made of ASICla and ASIC2a subunits in 
central neurons and of ASIC1b-containing channels in nociceptors 
is involved in the analgesic effect of mambalgins. These findings 
identify new potential therapeutic targets for pain and introduce 
natural peptides that block them to produce a potent analgesia. 

In a screen to discover new blockers of ASIC channels from animal 
venoms, we identified the venom of black mamba (Dendroaspis poly- 
lepis polylepis) as a potent and reversible inhibitor of the ASICla 
channel expressed in Xenopus oocytes (Fig. la). After purification, 
two active fractions were collected (Supplementary Fig. 1a, b). A partial 
amino-acid sequence was used to clone by degenerated PCR the cor- 
responding complementary DNA (cDNA) from lyophilized venom 
(Supplementary Fig. 1c). Two isopeptides were identified and named 
mambalgin-1 and mambalgin-2. They are composed of 57 amino acids 
with eight cysteine residues, and only differ by one residue at position 4 
(Supplementary Fig. 1d). 

Mambalgins belong to the family of three-finger toxins’ (Sup- 
plementary Fig. 1d, e). They have no sequence homology with either 
PcTx1 or APETx2, two toxins that we previously identified to target 
ASIC channels'*"*. A three-dimensional model of mambalgin-1 has 
been established from a pool of five templates with known structures 
(Supplementary Fig. le), which shows a triple-stranded and short 
double-stranded antiparallel B-sheets connecting loops II and III, 
and loop I, respectively, the three loops emerging from the core of 
the toxin like fingers from a palm (Fig. 1b). The model structure pre- 
sents a concave face commonly found in neurotoxins and is stabilized 
by four disulphide bonds with a pattern identical to that observed in the 
crystal structure template (Cys 1-Cys 3, Cys 2-Cys 4, Cys 5-Cys 6 and 
Cys 7-Cys 8; Fig. 1b and Supplementary Fig. 1f). Mambalgins show a 
strong positive electrostatic potential that may contribute to binding to 
negatively charged ASIC channels (Fig. 1c). 


Mambalgins have the unique property of being potent, rapid and 
reversible inhibitors of recombinant homomeric ASIC1a and hetero- 
meric ASICla + ASIC2a or ASICla + ASIC2b channels, that is, all the 
ASIC channel subtypes expressed in the central nervous system'*"*, 
with a similar potency for both isopeptides and IC; values (the toxin 
concentration half-maximally inhibiting the current) of 55nM, 
246nM and 61 nM, respectively (Fig. 1d). Mambalgins also inhibit 
human ASIC channels (Supplementary Fig. 2a). The peptides inhibit 
ASIC1b and ASICla + ASIC1b channels that are specific of sensory 
neurons’? with ICso values of 192 nM and 72 nM, respectively (Fig. le). 
Mambalgins, which bind to the closed and/or inactivated state of the 
channels (Supplementary Fig. 2b), modify the affinity for protons 
(pHo.sact shifted from 6.35 + 0.04 to 5.58 + 0.02, and pHo sinact shifted 
from 7.10 0.01 to 7.17 + 0.01, n = 4, P<0.001 andn = 5, P<0.05, 
respectively; Fig. 1f) and act as gating modifier toxins. They potently 
inhibit native ASIC currents in spinal cord, hippocampal and sensory 
neurons (Fig. 1g, h and Supplementary Fig. 3). In central spinal cord 
neurons, mambalgin-1 (674 nM) decreased ASIC current amplitude to 
13.0 + 2.0% (n = 14) of the control (Fig. 1g) and reduced the excitability 
in response to acidic pH without unspecific effect on basal neuronal 
excitability (resting potential) or on the threshold and the shape of 
evoked or spontaneous action potentials and on spontaneous post- 
synaptic currents (Supplementary Fig. 4). Mambalgins had no effect 
on ASIC2a, ASIC3, ASICla + ASIC3 and ASIC1b + ASIC3 channels, 
as well as on TRPV1, P2X2, 5-HT3,, Nav1.8, Cav3.2 and Kv1.2 channels 
(Supplementary Figs 5 and 6). 

Most three-finger toxins, such as %-neurotoxins that block nicotinic 
acetylcholine receptors”, evoke neurotoxic effects in animals. This is 
not the case of mambalgins, which do not produce motor dysfunction 
(Supplementary Fig. 7), apathy, flaccid paralysis, convulsions or death 
upon central injections (intrathecal or intracerebroventricular) in 
mice, but instead induce analgesic effects against acute and inflam- 
matory pain (Fig. 2) that can be as strong as morphine but resistant to 
naloxone, with much less tolerance (Fig. 3a) and no respiratory distress 
(Fig. 3b). In the tail-flick and paw-flick tests, intrathecal injection 
of mambalgin-1 (or mambalgin-2) increased the latency for the 
tail and paw withdrawal reflex from 8.8+0.4s and 8.0+0.8s to 
23.2+1.3s and 19.8 + 1.6 s, respectively, 7 min after injection (n = 
15-22, P< 0.001) (Fig. 2a, b). The effects were completely lost in 
ASIC1a knockout mice (Fig. 2a, b), demonstrating the essential impli- 
cation of ASICla-containing channels. The key involvement of ASIC 
channels present in central neurons in the analgesic effect of mambalgins 
was confirmed using intracerebroventricular injections of the peptides 
(Supplementary Fig. 8). Mambalgin-1 also suppressed inflammatory 
heat hyperalgesia and produced a strong analgesia evaluated in the 
paw-flick test after intraplantar injection of carrageenan (Fig. 2c), and 
drastically decreased acute (phase I) and inflammatory (phase II) 
pain assessed in the formalin test (Fig. 2d), with a potency similar to 
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Figure 1 | Mambalgins represent a new class of three-finger toxins targeting 
ASIC channels. a, Black mamba venom (0.1 mg ml’) reversibly inhibits rat 
ASICla current expressed in Xenopus oocytes. b, Three-dimensional model of 
mambalgin-1 (disulphide bridges in yellow). c, Electrostatic properties of 
mambalgin-1 and human ASICla channel (on the basis of the three- 
dimensional structure of chicken ASIC1a”) with positive and negative 
isosurfaces in blue and red, respectively. d, e, Inhibition of rat ASIC channels 
expressed in COS-7 cells (n = 4-15; peptides applied before the pH drop as in 
a). Hill coefficients of 0.7-1 suggest a 1:1 stoichiometry between mambalgins 
and channels. f, Effect of mambalgin-1 on the pH-dependent activation and 
inactivation of ASIC1a current recorded in Xenopus oocytes (protocol shown in 
inset). g, h, Inhibition of native ASIC currents in dorsal spinal cord neurons 
(g) and DRG neurons (h) by mambalgin-1 (674nM) and PcTx1 (20 nM), 
n= 14-34. Right panels: effect of mambalgin-1 on neurons expressing no 
PcTx1-sensitive current. (V;, = —60 mV ina, d,e and —50mV in 

f-h.) Mean + s.e.m. 


morphine (Fig. 2c, d). These effects were not significantly decreased 
by naloxone. 

Mambalgins, unlike the spider peptide PcTx1 (refs 5, 14), inhi- 
bit not only homomeric ASICla channels but also heteromeric 
ASICla + ASIC2 channels, which are abundantly expressed in central 
neurons’*’’*!”’, This led us to analyse the participation of ASIC2 in 
central analgesia evoked by mambalgins. Intrathecal injections of short 
interfering RNA (siRNA) to silence either ASIC2 (both variants a and 
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Figure 2 | Intrathecal injections of mambalgin-1 exert potent naloxone- 
resistant and ASICla-dependent analgesia in mice. a, b, Effects on acute 
thermal pain (46 °C) determined using tail-immersion (a, n = 13-22) and paw- 
immersion (b, n = 5-15) tests showing a large increase in response latencies, 
which is lost in ASIC1a~ mice. c, Effects on inflammatory hyperalgesia induced 
by intraplantar carrageenan (grey bar) (n = 8-16). Right panels in a-c: area 
under curve (AUC, s X min) calculated from each mouse. d, Effects on first 
(0-10 min) and second (10-45 min) phase of formalin-induced spontaneous 
pain behaviour (n = 11-20). Naloxone was injected subcutaneously 10 min 
before the peptide. Comparisons are versus vehicle unless specified. 

Mean = s.e.m. ***P < 0.001; **P < 0.01; *P < 0.05; NS, P> 0.05. 


b) or ASIC2a (Supplementary Fig. 9) induced an analgesia that was 
partly (ASIC2) or fully (ASIC2a) resistant to naloxone (Fig. 3c). In the 
presence of naloxone, central injection of mambalgin-1 in these 
knockdown mice had a decreased effect (Fig. 3d), consistent with a 
contribution of ASIC2a in the pain suppressing effect of the peptide. 
Compensation by homomeric ASIC1la channels’*"*, which are also 
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Figure 3 | The central analgesic effect of mambalgin-1 shows reduced 
tolerance compared with morphine, no respiratory depression and involves 
the ASIC2a subunit. a, Repeated intrathecal injections of mambalgin-1 induce 
less tolerance than morphine at concentrations giving the same analgesic 
efficacy (n = 10, comparison with vehicle (*) or morphine (#)). b, Mambalgin- 
1 (i.t., intrathecal) induces no respiratory depression unlike morphine (i.t., 
intrathecal or i.p., intraperitoneal), 0.01 and 0.4 mg per mouse, respectively; 

n = 4-7, comparison with vehicle unless specified). c, Paw-flick latency before 
(control) and after treatment with non-specific siRNA (si-CTR, n = 24), siRNA 
against ASIC2a and ASIC2b (si- ASIC2a/2b, n = 27), or siRNA against ASIC2a 
(si- ASIC2a, n = 19) with or without naloxone. Comparison with si-CTR (#) or 
untreated control (*), unless specified. d, Paw-flick area under curve calculated 
after intrathecal injection of mambalgin-1 or vehicle in siRNA-treated mice 
(n = 5-27; protocol shown in inset above; s.c., subcutaneous). Comparison 
with mice injected with vehicle after treatment by si-CTR, unless specified. 
Mean + s.e.m. *** or ###, P< 0.001; ** or ##, P< 0.01; *P < 0.05; NS, 
P>0.05. 


blocked by mambalgin-1, as well as incomplete in vivo knockdown 
(Supplementary Fig. 9) account for the residual analgesic effect that is 
observed. 

Because mambalgins are able to target different ASIC channel sub- 
types expressed in nociceptors, we tested their peripheral effect after 
intraplantar injections. Mambalgin-1 (unlike PcTx1) has a significant 
analgesic effect on acute heat pain (Fig. 4a) and reverts or prevents 
inflammatory hyperalgesia (Fig. 4b, c). However, this effect is clearly 
different from the previously described effect of central (intrathecal) 
injection of the peptide because it is still present in ASIC1a knockout 
mice (Fig. 4a—c) contrary to the central effect (Fig. 2a, b). If ASIC1a is 
not involved, what are then the mechanisms that support the peri- 
pheral effect of mambalgins? ASIC1b is specifically expressed in noci- 
ceptors’ but its role in pain is not known. A functional expression of 
ASIC1b-containing channels in dorsal root ganglion (DRG) neurons is 
demonstrated by the effect of mambalgin-1, which blocks both ASICla 
and ASIC1b, and has more potent effects than PcTx1, which only blocks 
ASICla channels (Fig. 1h). Moreover, silencing of the ASIC1b subunit 
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Figure 4 | Intraplantar injections of mambalgin-1 evoke peripheral 
analgesic effects through ASIC1b-containing channels. a, Effects of 
mambalgin-1 on acute thermal pain (46 °C) in wild-type (WT) and ASICla 
mice (n = 10-23). b, Effects of mambalgin-1 on carrageenan-induced 
hyperalgesia when injected before (open symbols) or 2h after (filled symbols) 
carrageenan (n = 7-20). Right panels: area under curve (after carrageenan in 
b). Comparison with vehicle unless specified. c, Mambalgin-1 prevents 
inflammatory hyperalgesia when co-injected with carrageenan (n = 7-20, 
comparison with control). d, Paw-flick latency before (control) and after 
treatment with si-CTR (m = 13) or siRNA against ASIC1la and ASICIb (si- 
ASICla/1b, n = 10) in ASIC1a~~ mice upon normal and inflammatory 
conditions. Comparison with si-CTR (#) or untreated control (*), unless 
specified. e, Paw-flick area under curve calculated after injection of mambalgin- 
1 or vehicle in siRNA-treated ASIC1a~~ mice (n = 7-11). Comparison with 
mice injected with vehicle after treatment by si-CTR, unless specified. 

Mean = s.e.m. *** or ###, P< 0.001; ** or ##, P< 0.01. 


in nociceptors of ASIC1a knockout mice (that is, where only ASIC1b is 
present; Supplementary Fig. 9) mimicked the analgesic effect of peri- 
pheral injection of mambalgin-1 on both acute pain and inflammatory 
hyperalgesia, and largely decreased the effect of subsequent intraplantar 
injection of the peptide (Fig. 4d, e), supporting the specific participation 
of ASIC1b in the peripheral effect of mambalgin-1. 

Our results indicate that mambalgins have analgesic effects by 
targeting both primary nociceptors and central neurons, but through 
different ASIC subtypes (Supplementary Fig. 12). After demonstrating 
the important role of ASIC3 in peripheral pain and sensory perception 
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in the skin****, we now show that ASIC1b, but not ASICla, is impor- 
tant for cutaneous nociception and inflammatory hyperalgesia. In the 
central nervous system, injections of mambalgins evoke a naloxone- 
insensitive analgesia through an opioid-independent pain pathway 
involving ASICla + ASIC2a channels. Central injections of PcTx1, 
instead, evoke a naloxone-sensitive analgesia through its specific action 
on homomeric ASIC1a’, and probably heteromeric ASIC1a + ASIC2b 
channels (Fig. 3c and ref. 22). In addition, mambalgins, unlike PcTx1 
(ref. 5), maintain a potent analgesia in mice deficient for the preproen- 
kephalin gene (Supplementary Fig. 10). These results taken together 
indicate that different pathways involving different subtypes of 
ASIC channels can lead to different types of central analgesia (opioid- 
sensitive or insensitive) (Supplementary Fig. 12). They also indicate that 
despite their capacity in vitro to inhibit homomeric ASIC1a and hetero- 
meric ASICla + ASIC2b channels, in vivo, mambalgin central analgesic 
action is mainly targeted to neurons expressing ASICla + ASIC2a 
channels (Supplementary Fig. 11). 

It is essential to understand pain better to develop new analgesics 
The black mamba peptides discovered here have the potential to 
address both of these aims. They show a potent role for different 
ASIC channel subtypes in both the central and peripheral pain path- 
ways, providing promising new targets for therapeutic interventions 
against pain, and they are themselves powerful, naturally occurring, 
analgesic peptides of potential therapeutic value. 


25,26 


METHODS SUMMARY 


Mambalgins were purified from Dendroaspis polylepis polylepis venom (Latoxan) 
using cation exchange and reverse phase chromatography steps. The molecular 
mass and peptide sequence were determined by Edman degradatation and/or 
tandem mass spectrometry sequencing, and used to design primers for cloning 
the full-length mambalgin-1 cDNA from venom. The three-dimensional structure 
was modelled from five templates of three-finger snake toxins. Recordings of 
recombinant ASIC currents were done after expression in Xenopus laevis oocytes" 
and COS-7 cells, and patch-clamp recordings of native ASIC currents were 
obtained from primary cultures of mouse dorsal spinal cord neurons’®, hippocam- 
pal neurons and rat DRG neurons. Pain behaviour experiments were performed in 
C57BL/6J mice after intrathecal (or intracerebroventricular) injection of mambal- 
gins (10 ul at 341M), PcTx1 (10 pl at 10M) or morphine (Cooper, 10 pl at 
3.1 mM), in the absence or in the presence of naloxone (Fluka, 2 mg kg), and 
after intraplantar injection of mambalgins (10 ul at 3441M) and PcTx1 (10 pl 
at 10uM). The effects of mambalgins on acute pain were also tested in 
ASICla"~ (ref. 27) and Penk1~~ (ref. 28) mice. Inflammation was evoked by 
intraplantar 2% carrageenan (Sigma) or 2% formalin. In vivo ASIC1 and ASIC2 
gene silencing experiments were performed by repeated intrathecal injections (2 1g 
per mouse twice a day for 3 days) of siRNAs targeting ASICla + b, ASIC2a + b or 
only ASIC2a, mixed with i-Fect (Neuromics) as previously described’. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Electrophysiology in Xenopus laevis oocytes. Venom fractions were tested on 
rat ASICla expressed in Xenopus oocytes as previously described'*, applied 30s 
before the acid stimulation. 
Purification, peptide sequencing and mass spectrometry. The venom of the 
black mamba Dendroaspis polylepis polylepis (Latoxan) was purified by gel filtra- 
tion and cation exchange’’. The active fraction was loaded on a reversed-phase 
column (C18 ODS, Beckman) and eluted with a linear gradient of acetonitrile 
containing 0.1% TFA. Molecular mass and peptide sequence were determined by 
matrix-assisted laser desorption/ionization-time of flight (MALDI-TOF)/TOF-MS 
(Applied Biosystems). Protein identification was performed with mascot (http:// 
www.matrixscience.com) at 50 p.p.m. mass tolerance against NCBI (non-redundant) 
and Swiss-Prot databases. Data were analysed using the GRAMS386 software. 
Partial sequence was obtained by amino (N)-terminal Edman degradation and 
protease digestion (V8 protease and trypsin) followed by tandem mass spectro- 
metry sequencing. Peptide analysis was performed using a nano-high-performance 
liquid chromatography offline (Dionex, U3000) coupled with a 4800 MALDI-TOF/ 
TOF mass spectrometer. 
Cloning of the mambalgin-1 cDNA. Mambalgin-1 cDNA was cloned from the 
black mamba venom*'. Lyophilized venom (Sigma) was reconstituted in lysis/ 
binding buffer and polyadenylated mRNAs were captured on oligo(dT25) mag- 
netic beads (Dynal). After first-strand cDNA synthesis, PCR-amplification was 
done with degenerated sense (TGITTYCARCAYGGIAARGT) and antisense 
(YTTIARRTTICGRAAIGGCAT) primers designed from the partial peptide 
sequence obtained from biochemical purification. A specific sense primer (ACAC 
GAATTCGCTATCATAACACTGGCATG) was designed from the new sequence 
and used with an unspecific poly-dT30 antisense primer (ACACGAATTCdT30) 
to amplify the 3’-coding and uncoding sequences of mambalgin-1. Using the very 
strong conservation of the 3’- and 5'-uncoding sequences among snake toxins”, 
we have designed a sense (ACACGAATTCTCCAGAGAAGATCGCAAGATG) 
and an antisense (ACACGAATTC-ATTTAGCCACTCGTAGAGCTA) primer 
to amplify the complete open reading frame of the toxin precursor. 
Template-based three-dimensional modelling of mambalgin-1. We modelled 
the mambalgin-1 protein using the semi-automatic pipeline of the webserver 
@TOME version 2.1 (ref. 33). The amino-acid sequence was submitted to the 
server to perform the fold recognition and detect structural homologue templates 
from the Protein Data Bank. Active fold-recognition tools were HHSEARCH™, 
SP3**, PsiBlast*® and Fugue’’. Five templates were selected among snake venom 
toxins with four disulphide bonds and aligned with Muscle**. The homology 
modelling of mambalgin-1 was performed with Modeller 9v8. The overall quality 
of models was estimated by calculating the one- and three-dimensional compatibil- 
ity TITO score”, by analysing the Ramachandran by MolProbity and comparing it 
with scores of the templates*® and by visual inspection. 
Electrostatic potential calculation. Electrostatic properties of mambalgin-1 (iso- 
surfaces of +3kpT/e. (~ +77mV) and —3kpgT/e. (~ —77mV)) and human 
ASICla channel (isosurfaces of +10kpT/e. (~ +256mV) and —10kpT/e. 
(~ —256 mV)) have been calculated with the Adaptive Poisson-Boltzmann Solver*'. 
Electrophysiology in COS cells and neurons. COS-7 cells were transfected with 
pCI-ASICs mixed with pIRES2-EGFP and jet-PEI. Primary cultures of dorsal spinal 
neurons were obtained from C57BI6J mice embryos (embryonic day (E)14)’*. 
Primary cultured hippocampal neurons were prepared from C57BI6J mice (P3- 
P5) as previously described for rats’*. Primary cultured sensory neurons were pre- 
pared from dorsal root ganglia of Wistar rats (5-7 weeks) as previously described’. 
Data were recorded in the whole-cell configuration, sampled at 3.3 kHz and low- 
pass filtered at 3kHz using pClamp8 software (Axon Instruments). The pipette 
solution was (in mM) KCl 140, NaCl 5, MgCl, 2, EGTA 5, HEPES-KOH 10 (pH 
7.4); the bath solution was (in mM): NaCl 140, KCl 5, MgCl, 2, CaCl, 2, HEPES- 
NaOH 10 (pH 7.4). MES was used instead of HEPES for pH from 6 to 5. The bath 
solution for neurons was supplemented with 10 mM glucose, and 20 uM CNQX/ 
10 4M kynurenic acid for central neurons. The pipette solution for neurons con- 
tained (inmM) KCl 140, ATP-Nay 2.5, MgCl, 2, CaCl, 2, EGTA 5, HEPES 10 
(pH 7.3, pCa estimated to 7). Toxins were perfused at pH 7.4 with bovine serum albu- 
min (0.05%) to prevent non-specific adsorption. Concentration—response curves were 
fitted by the Hill equation: I = Imax + Umin — Imax) (Cny/(Cny + ICson,,)) where I is 
the amplitude of relative current, C is the toxin concentration, ICs, is the toxin 
concentration half-maximally inhibiting the current, and ny is the Hill coefficient. 
Plethysmography. Respiratory frequency (breaths per minute) was recorded from 
7 to 67 min after intrathecal injection of vehicle, mambalgin-1 or morphine-HCl 
(according the same protocol than for pain behaviour) or intraperitoneal injection 
of morphine-HCl (24.8 mM, 50 ul) with a whole-body plethysmograph (Emka 
Technologies). 
Pain behaviour experiments. Experiments were performed on awake 7- to 
11-week-old (20-25 g) male C57BL/6J, ASICI a!” and Penk1/~** mice following 


the guidelines of the International Association for the Study of Pain and were 
approved by the local ethics committee (agreements NCA/2007/04-01 and NCE/ 
2011-06). Mambalgin-1 (344.M), mambalgin-2 (20M), PcTxl (10M) and 
morphine-HCl (3.1 mM; Cooper) dissolved in vehicle solution (inmM: NaCl 145, 
KC15, MgCl 2, CaCl, 2, HEPES 10, pH 7.4, 0.05% BSA for intrathecal injection, and 
NaCl 154, 0.05% BSA for intraplantar injection) were injected intrathecally (10 ul) 
between spinal L5 and L6 segments or intraplantarly (10 pl). Naloxone (Fluka, 
2mgkg™' in NaCl 0.9%, 5041) was subcutaneously (dorsal injection) injected 
10min before intrathecal injection. Inflammation was evoked by intraplantar 
injection in the left hindpaw of 2% carrageenan (Sigma-Aldrich) (20 ull) 2 h before 
intrathecal or intraplantar injection of peptides, morphine or vehicle. 
Knockdown experiments. Locally designed siRNAs targeting ASIC1 (si- 
ASICla/1b, GCCAAGAAGUUCAACAAAUadtdt), ASIC2 (si-ASIC2a/2b, UGA 
UCAAAGAGAAGCUAUUdtdt) and ASIC2a (si-ASIC2a, AGGCCAACUUC 
AAACACUAdtdt) have been validated in vitro in COS-7 cells transfected with 
myc-ASICla, ASIC1b, myc-ASIC2a or myc-ASIC2b, and the relevant siRNA ora 
control siRNA (si-CTR, GCUCACACUACGCAGAGAUdtdt) with TransIT-LT1 
and transIT-TKO (Mirus), respectively. Cells were lysed 48 h after transfection and 
processed for western blot analysis to assess the amount of ASICla protein with 
the anti-Myc A14 antibody (1:500; Santa Cruz Biotechnology) or the anti-ASIC1 
antibody (N271/44; 1:300; NeuroMab) and a monoclonal antibody against actin 
(AC-40; 1:1,000; Sigma) as a loading control. siRNAs were intrathecally injected into 
mice (2 [ig per mouse at a ratio of 1:4 (w/v) with i-Fect (Neuromics)) twice a day for 
3 days. After 3 days of treatment, the paw-flick latency was measured and the resi- 
dual effect of mambalgin-1 (intrathecal or intraplantar, 34 uM) or the carrageenan 
(intraplantar, 2%)-induced hyperalgesia was tested. For validation of the in vivo 
effect of the siRNAs, lumbar DRGs or lumbar dorsal spinal cord were removed 
after the last siRNA injection for total RNA isolation (RNeasy kits, Qiagen) fol- 
lowed by cDNA synthesis (AMV First-Strand cDNA synthesis kit (Invitrogen) and 
High Capacity RNA-to-cDNA Kit, (Applied Biosystems)). The relative amounts of 
ASIC transcripts were evaluated by quantitative reverse-transcription PCR in a 
Light-Cycler 480 (Roche Products). Pre-designed and validated TaqMan assays for 
ASIC1 (ASICla and ASIC1b; Mm01305998_mH), ASIC1a (Mm01305996_m1), 
ASIC2 (ASIC2a and ASIC2b; Mm00475691_m1), ASIC3 (Mm00805460_m1) and 
18S ribosomal RNA (Mm03928990_g1) were from Applied Biosystems. Each 
cDNA sample was run in triplicate and results were normalized against 18S and 
converted to fold induction relative to control siRNA treatment. 

Data analysis. Data were analysed with Microcal Origin 6.0 and GraphPad Prism 
4, Areas under the time course curves (response latency in seconds X time after 
injection in minutes) were calculated for each mouse (over the first 37 min for tail- 
flick and the entire time range for paw-flick) and expressed as mean + s.e.m. After 
testing the normality of data distribution, the statistical difference between two 
different experimental groups was analysed by unpaired Student’s t-test, and 
between more than two different experimental groups by a two-way analysis of 
variance followed by a Newman-Keuls multiple comparison test when P < 0.05. 
For data in the same experimental group, a paired Student’s t-test was used. *** or 
###, P< 0.001; ** or ##, P<0.01; *P < 0.05; NS, P> 0.05. 
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Structure-based prediction of protein-protein 
interactions on a genome-wide scale 
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The genome-wide identification of pairs of interacting proteins is an 
important step in the elucidation of cell regulatory mechanisms’. 
Much of our present knowledge derives from high-throughput tech- 
niques such as the yeast two-hybrid assay and affinity purification’, 
as well as from manual curation of experiments on individual sys- 
tems’. A variety of computational approaches based, for example, on 
sequence homology, gene co-expression and phylogenetic profiles, 
have also been developed for the genome-wide inference of protein- 
protein interactions (PPIs)**°. Yet comparative studies suggest that 
the development of accurate and complete repertoires of PPIs is 
still in its early stages’°. Here we show that three-dimensional 
structural information can be used to predict PPIs with an accuracy 
and coverage that are superior to predictions based on non-structural 
evidence. Moreover, an algorithm, termed PrePPI, which combines 
structural information with other functional clues, is comparable in 
accuracy to high-throughput experiments, yielding over 30,000 high- 
confidence interactions for yeast and over 300,000 for human. 
Experimental tests of a number of predictions demonstrate the ability 
of the PrePPI algorithm to identify unexpected PPIs of considerable 
biological interest. The surprising effectiveness of three-dimensional 
structural information can be attributed to the use of homology 
models combined with the exploitation of both close and remote 
geometric relationships between proteins. 

Until now, structural information has had relatively little impact in 
constructing protein-protein interactomes, primarily because there is 
a marked difference between the number of proteins with known 
sequences and those with an experimentally known structure. For 
example, as of early 2010, the Protein Data Bank (PDB) provided 
structures for ~600 of the total complement of ~6,500 yeast proteins 
(~10%), whereas the structural coverage of protein-protein complexes 
is even more sparse, with only about 300 structures available out of the 
approximately 75,000 PPIs (<0.5%) recorded in publically available 
databases. However, ~3,600 additional yeast proteins have homology 
models in either the ModBase"® or SkyBase"! databases. Moreover, there 
were about 37,000 protein-protein complexes derived from multiple 
organisms in the PDB and Protein Quaternary Structure’* (PQS) data- 
bases, which might be used as ‘templates’ to model PPIs. Clearly, if 
structure is to be useful on a large scale, it is essential that modelling 
of individual proteins and of complexes be exploited. 

A number of studies have used structurally characterized complexes 
as templates to construct models of the complexes that might be 
formed between proteins that have been classified as having sequence 
and/or structural relationships to the proteins in the template’*. We 
searched more broadly for templates, using geometric relationships 
between groups of secondary structure elements as revealed by struc- 
tural alignment, independently of how they are classified. It has been 
demonstrated that even distantly related proteins often use regions of 
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their surface with similar arrangements of secondary structure elements 
to bind to other proteins'*"*, suggesting the possibility of considerably 
expanding the number of putative PPIs that can be identified. It is 
likely that further expansion can be achieved if interactions involving 
unstructured regions of proteins are taken into account, but these are 
not considered in this work. 

Our approach to the prediction of PPIs is embodied in an algorithm 
we have named PrePPI (predicting protein-protein interactions), 
which combines structural and non-structural interaction clues using 
Bayesian statistics (see Fig. 1 and Methods for details). The structural 
component of PrePPI involves a number of steps. Briefly, given a pair 
of query proteins (QA and QB), we first use sequence alignment to 
identify structural representatives (MA and MB) that correspond to 
either their experimentally determined structures or to homology mod- 
els. We then use structural alignment to find both close and remote 
structural neighbours (NA; and NB;) of MA and MB (an average of 
~1,500 neighbours are found for each structure). Whenever two (for 
example, NA, and NB;) of the over 2 million pairs of neighbours of MA 
and MB form a complex reported in the PDB, this defines a template for 
modelling the interaction of QA and QB. Models of the complex are 
created by superimposing the representative structures on their corres- 
ponding structural neighbours in the template (that is, MA on NA, and 
MB on NB;). This procedure produces about 550 million ‘interaction 
models’ for about 2.4 million PPIs involving about 3,900 yeast proteins, 
and about 12 billion models for about 36 million PPIs involving about 
13,000 human proteins. Note that an interaction model is based on 
structure-based sequence alignments of query proteins to their indi- 
vidual templates (Supplementary Fig. 1) and that we do not construct a 
three-dimensional model of each complex because the scoring of so 
many individual complexes would be prohibitively time consuming 
using standard energy functions (for example, as used in docking"). 

Once an interaction model has been created, it is evaluated using a 
combination of five empirical scores that measure properties derived 
from alignments of the individual monomers to their templates 
(Supplementary Fig. 1). The first score, SIM, depends on the structural 
similarity between models of the two query proteins (that is, MA and 
MB) and those in the template complex (that is, NA; and NB3;). The 
next two scores determine whether the interface in the template com- 
plex actually exists in the model. They are calculated as SIZ, the num- 
ber, and COV, the fraction, of interacting residue pairs in the template 
(for example, NA|-NB;) that align to some pair of residues in the 
model (MA-MB). The final two scores reflect whether the residues that 
appear in the model interface have properties consistent with those that 
mediate known PPIs (for example, residue type, evolutionary conser- 
vation, or statistical propensity to be in protein-protein interfaces). This 
information is obtained from three publically available servers that 
predict interfacial residues based on the sequence and structure of the 
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Figure 1 | Predicting protein-protein interactions using PrePPI. Given a 
pair of query proteins that potentially interact (QA, QB), representative 
structures for the individual subunits (MA, MB) are taken from the PDB, where 
available, or from homology model databases. For each subunit we find both 
close and remote structural neighbours. A ‘template’ for the interaction exists 
whenever a PDB or PQS structure contains a pair of interacting chains (for 
example, NA;-NB3) that are structural neighbours of MA and MB, 
respectively. A model is constructed by superposing the individual subunits, 


individual subunits of the model*°~’. These scores are calculated as OS, 
which is identical to SIZ but with the additional requirement that both 
residues in an interacting pair of the template align to predicted inter- 
facial residues in MA and MB, and OL, the number of template inter- 
facial residues that align to predicted interfacial residues in MA and MB. 
We note that although the interaction models produced by our pro- 
cedure can reveal the approximate locations of potential interfaces, 
they will not, in general, be accurate at atomic resolution. 

The five empirical scores are combined using a Bayesian network 
(Supplementary Fig. 2) to yield a likelihood ratio (LR) that a candidate 
protein-protein complex represents a true interaction (see Methods). 
The network is trained on positive and negative ‘gold standard’ ref- 
erence data sets. Similar to two recent studies”*”*, we combine inter- 
action data from multiple databases to ensure a broad coverage of true 
interactions. We divide these sets into high-confidence (HC) and low- 
confidence (LC) subsets (Supplementary Table 1); the HC sets contain 
11,851 yeast interactions and 7,409 human interactions that have more 
than one publication supporting their existence; interactions with only 
one supporting publication compose the LC set. All potential PPIs in a 
given genome not in the HC plus LC set form the negative (N) ref- 
erence set. Using the Bayesian network classifier trained on the yeast 
HC set, we select the best interaction model with the highest LR for 
each PPI. 

To assess quantitatively the performance of structural modelling 
(SM), we compared it with a number of non-structural clues prev- 
iously used to infer PPIs***: (1) essentiality of the proteins in the 
interacting pair; (2) co-expression level; (3) gene ontology (GO) func- 
tional similarity; (4) Munich Information Centre for Protein 
Sequences (MIPS) functional similarity; and (5) phylogenetic profile 
similarity. We used the same algorithms or data for clues 1-4 as previ- 
ously described* but developed our own phylogenetic profile algo- 
rithm (for details, see Methods and Supplementary Table 2). Briefly, 
a phylogenetic profile was constructed for every protein using a set of 
completely resolved proteomes as references. Because interacting 
proteins tend to co-evolve, proteins with similar profiles are predicted 
to interact. 

As shown in Supplementary Figs 3 and 4, SM yields comparable 
performance to other clues over the entire range of false positive rate 
(FPR) values but is considerably more effective at low FPRs (for 
example, FPR =0.1%). This is critical as, owing to the huge number 
of negative interactions, only very low FPR rates can produce a small 
enough number of false positives to be used effectively in practice. At 
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MA and MB, on their corresponding structural neighbours, NA; and NB;. We 
assign five empirical-structure-based scores to each interaction model 
(Supplementary Fig. 1) and then calculate a likelihood for each model to 
represent a true interaction by combining these scores using a Bayesian 
network (Supplementary Fig. 2) trained on the HC and the N interaction 
reference sets. We finally combine the structure-derived score (SM) with non- 
structural evidence associated with the query proteins (for example, co- 
expression, functional similarity) using a naive Bayesian classifier. 


low FPRs, SM by itself outperforms even the naive Bayesian classifiers 
that combine all non-structure-based clues (NS). Looking specifically 
at the thousands of high-confidence SM predictions in the LC and the 
N sets with an LR score >600 (a value used in ref. 25 and correspond- 
ing in our study to a FPR of ~0.1%; see Methods), about 70% and 50%, 
respectively, share GO biological terms at, or more specific than, the 
sixth level of the GO hierarchy, suggesting that many of these inter- 
actions are real (Supplementary Fig. 5). 

As mentioned earlier, PrePPI combines structural and non-structural 
clues using a naive Bayesian network****. Supplementary Fig. 4 shows 
that the performance of PrePPI is superior to that obtained from struc- 
tural and non-structural evidence alone, implying that the two sources of 
information are largely complementary. This point can be clearly seen in 
the Venn diagrams of high-confidence (LR >600) predictions shown in 
Supplementary Fig. 6. It is evident from the figure that combining struc- 
tural and non-structural clues yields many more high-confidence pre- 
dictions and identifies more interactions in the HC set than either source 
of information alone. As an independent test of PrePPI, we assessed its 
performance against one of the challenges in the 2009 Dialogue for 
Reverse Engineering Assessments and Methods (DREAM) workshop 
specifically aimed at PPI predictions’. As discussed in Supplementary 
Table 3, PrePPI outperformed all other methods for cases where struc- 
tural information is available. 

We compared the performance of PrePPI to that of high-throughput 
experiments (Supplementary Table 4) using data provided in a detailed 
comparison of different high-throughput techniques reported previ- 
ously*’. We used the data sets in ref. 23 to define true positives, and 
compiled a new negative reference set that consists of protein pairs in 
which each protein is annotated as localized to a different cellular com- 
partment (see Supplementary Fig. 7 and Methods). This was essential 
for comparison to experimental assays because, as constructed, our N 
set excludes data compiled from high-throughput experiments, and 
hence the FPR for experimental assays is artificially zero (see also 
related discussion in supplementary information in ref. 23). 

As can be seen in the receiver operating characteristic (ROC) curves 
reported in Fig. 2a and Supplementary Fig. 8, PrePPI performance is 
generally comparable, although somewhat better overall, than high- 
throughput methods for most data sets that were tested. Figure 2b 
shows a Venn diagram in which the PrePPI data set is based on an 
LR cutoff of 600 (FPR ~ 0.1%). Results for other LRs and additional 
reference sets are shown in Supplementary Fig. 9. As can be seen, many 
of the interactions inferred by PrePPI are different from those identified 
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Figure 2 | ROC curve and Venn diagram for PrePPI predictions and high- 
throughput experiments in yeast. a, ROC curve. TPR, true positive rate. 

b, Venn diagram. The CCSB-PRS positive reference interaction set is defined in 
ref. 23 and described in Methods. High-throughput experiments are labelled 
with the first author of the relevant publication (Supplementary Table 4). The 
number of interactions in each set is given after the set label in the Venn 
diagram. 


by high-throughput assays. Methods that combine both approaches 
may thus prove to be highly effective in expanding the coverage of PPIs. 

At an LR cutoff of 600, PrePPI predicts 31,402 high-confidence 
interactions for yeast and 317,813 interactions for human. These, as 
well as predictions with lower LR scores, are available in a database 
from the PrePPI website (http://bhapp.c2b2.columbia.edu/PrePPI/). 
As a further validation of PrePPI we tested its performance on the 
approximately 24,000 new interactions involving human proteins that 
were added to public databases after August 2010 (Supplementary 
Table 5). Among these interactions, 1,644 are predicted by PrePPI to 
have an LR >600 (based on a Bayesian classifier derived from pre-2009 
data on yeast), so that they essentially correspond to experimental 
validation of true predictions. 

Specific experimental validation of 19 individual PrePPI predictions, 
using co-immunoprecipitation assays, was carried out in four separate 
laboratories, leading to confirmation of 15 of these interactions 
(Supplementary Figs 10-14 and Supplementary Table 6). Specifically, 
the investigators in each laboratory queried the PrePPI database for 
previously uncharacterized interactions involving proteins of interest 
and that, as much as possible, had relatively high SM and PrePPI scores 
(see Supplementary Table 6 for more information). 

One set of predictions involves potential PPIs formed between the 
nuclear receptor peroxisome proliferator-activated receptor y (PPAR-y) 
and other transcription factors. PPAR-y has a pivotal role in regulat- 
ing glucose and lipid metabolism, the inflammatory response and 
tumorigenesis, and is known to heterodimerize with retinoid X receptors 
(RXRs) and to recruit cofactors to regulate target gene transcription. 
PrePPI predicts high-confidence interactions between PPAR-y and 
the transcription factors LXR-B (also known as NR1H2), PAX7, 
PDX1, NKX2-2 and HHEX (Supplementary Table 6). Except for 
HHEX, all of the interactions were validated (Supplementary Fig. 10). 
The predicted interaction with nuclear receptor LXR-f might have been 
expected based on the ability of these proteins to heterodimerize through 
their ligand-binding domains. Nevertheless, this specific interaction had 
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not previously been characterized and suggests a so far unrecognized 
convergence of signalling and metabolic pathways regulated by these 
two nuclear receptors. The interaction between the ligand-binding 
domain of PPAR-y and the homeodomains of PAX7, PDX1 and 
NKX2-2 are new observations that require further studies, as they 
suggest that PPAR-y may have a role in endocrine progenitor and 
pancreatic B-cell development. 

A second set of examples involves suppressor of cytokine signalling 
(SOCS3), an SH2-domain-containing protein that negatively regulates 
cytokine-induced signal transduction. Until now, the mechanism of 
the inhibitory function of SOCS3 has been primarily established for its 
involvement in the JAK/STAT pathway. PrePPI predicts that SOCS3 
forms complexes with GRB2 and RAF1, two key components in the 
RAS/MAPK pathway, and these interactions were confirmed experi- 
mentally (Supplementary Fig. 11A, B). PrePPI also predicts the forma- 
tion of a complex between SOCS3 and BTK, a cytoplasmic tyrosine 
kinase important in B-lymphocyte development, differentiation and 
signalling, and this interaction was also validated (Supplementary 
Fig. 11C). The SOCS3-GRB2 interaction is predicted to be mediated 
by their SH2 domains, whereas the SOCS3 interaction with BTK is 
predicted to be mediated by an SH2-SH3 domain interaction. Analysis 
of the predicted binding preferences of SH2 domains as well as results 
on other protein families indicate that the PrePPI scoring function 
accounts, at least in part, for the binding preference of closely related 
protein domains (Supplementary Fig. 15, see also later). 

A third group of novel observations involves the identification 
of kinases that interact with the clustered protocadherin proteins (pro- 
tocadherin «, § and y (PCDH-«, -B and -y)). The PCDHs have six 
cadherin-like extracellular domains, and unique cytoplasmic domains. 
They assemble into large complexes at the cell surface, and associate 
with a variety of proteins, including signalling adaptors, kinases and 
phosphatases. Analysis of potential PCDH-kinase PPIs confirmed 
published interactions between PCDH-« and -y with the tyrosine 
kinase RET, and predicted interactions with ROR2, VEGFR2 and 
ABL1 (Supplementary Table 6 and Supplementary Fig. 12; experiments 
done in mice). PrePPI predicts that these PPIs are mediated by the 
extracellular cadherin domains and immunoglobulin (Ig) domains, a 
result that was confirmed experimentally (Supplementary Fig. 12A-D). 
A hydrophobic residue, Phe 64, of the ROR2 Ig domain is predicted to 
be in the centre of the interface it forms with PCDH-«4. Mutating this 
Phe to an Ala, a smaller hydrophobic residue, has no detectable effect on 
binding, whereas mutating it to charged residues considerably weakens 
the interaction (Supplementary Fig. 12B, C). These results suggest that, 
in addition to predicting binary interactions, PrePPI has the potential to 
reveal novel and unsuspected interfaces. 

The fourth group of experiments was carried out with the goal of 
identifying new components of large protein-protein complexes. We 
validated two previously uncharacterized interactions between the 
special AT-rich sequence-binding protein SATB2 and the Emerin 
‘proteome’ complex 32, and one involving the pre-mRNA-processing 
factor PRPF19 and the centromere chromatin complex (Supplemen- 
tary Fig. 13). Itis important to emphasize that each of the PPIs detected 
must be confirmed through appropriate in vivo experiments. Taken 
together, however, these findings suggest that PrePPI has sufficient 
accuracy and sensitivity to provide a wealth of novel hypotheses that 
can drive biological discovery. 

The accuracy and range of applicability of PrePPI, and the crucial 
role of structural modelling, were unanticipated, but should not come 
as a complete surprise. Most protein complexes in the PDB have 
structural neighbours that share binding properties'’, and protein 
interface space may well be close to ‘complete’ in terms of the packing 
orientations of secondary structure elements'*. Moreover, these ele- 
ments can be identified with geometric alignment methods’’”*, a fact 
that has been exploited in the approach introduced here. Although the 
information required to predict whether two proteins interact appears 
to be present in the PDB, the question has been how to mine the data. 
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Three key elements are responsible for the success of structural 
modelling and PrePPI. The first is the marked expansion in the num- 
ber of interactions that can be modelled, owing to the use of both 
homology models and remote structural relationships. About 8,600 
PDB structures but more than 31,000 models are found as represen- 
tatives of at least one domain of ~14,100 human proteins. If we had 
only used experimentally determined structures in our analysis, a total 
of only ~2.5 million human PPIs (versus 36 million when homology 
models are used) could have been modelled. Similarly, had we limited 
ourselves to structural neighbours taken from the same Structural 
Classification of Proteins (SCOP) fold, only ~225 thousand interactions 
could have been modelled, as opposed to 36 million. 

As might be expected, predictions based on structural modelling 
that use only PDB structures or close structural neighbours are more 
likely to recover known interactions (defined by their presence in 
databases) than those that only use homology models or remote struc- 
tural relationships (Supplementary Fig. 16). However, the latter, on 
their own, yield a marked expansion in the total number of interaction 
models and, consequently, many more high-confidence predictions 
and known interactions. Most importantly, in the calculation of the 
PrePPI score, the huge number of low-confidence structural inter- 
action models led to an even greater expansion in high-confidence 
predictions when combined with functional, evolutionary and other 
sources of evidence (Supplementary Fig. 16). 

The second key element in our strategy is the efficiency of our 
scoring scheme for interaction models, which allows us to evaluate 
an extremely large number of models while still discriminating among 
closely related family members. Discrimination among complexes 
involving members of the same protein family—that is, specificity— 
is obtained from the properties of the predicted interface, for example, 
the statistical propensity of certain amino acids to appear in inter- 
faces***’ (and, additionally, from non-structural clues; for example, 
are the two proteins co-expressed). As examples, our analysis of the 
SH2 and GTPase families shows that the structural modelling (and 
PrePPI scores) for these closely related proteins produce a wide range 
of LRs, with the higher LRs associated with a higher probability of 
being a known interaction (Supplementary Fig. 15). 

The third element responsible for the success of PrePPI is the 
Bayesian evidence integration method that allows independent and 
possibly weak interaction clues to be combined to make reliable pre- 
dictions and to improve prediction specificity (Supplementary Figs 15 
and 16). 

Figure 3 provides two examples of the use of remote structural 
relationships and homology models. In Fig. 3a, an HC set interaction 
of serine/threonine-protein kinase D1 (PRKD1) and protein kinase 
C-e (PRKCE) is recovered by structural modelling using a complex of 
two proteins in the ubiquitin pathway (not kinases) as template. Note 
that PRKD1 and PRKCE are not sequence homologues of the two 
corresponding ubiquitin pathway proteins and are classified as belong- 
ing to different SCOP folds. However, the interaction model has a 
significant SM score (LR = 130) arising from both local structural 
similarity and a conserved interface. Figure 3b describes a prediction 
of an LC set interaction between the elongation factor 1-6 (EEF1D) 
and the von Hippel-Lindau tumour suppressor (VHL) using the same 
template as that used in Fig. 3a. Again, there is no sequence relationship 
between the target and the template proteins, and they are classified 
into different folds. Nevertheless, the interaction model has an LR of 
70. We note that the EEF1D and VHL were found to interact using 
mass spectroscopy” and by co-immunoprecipitation experiments 
reported here (Supplementary Fig. 14). 

The exploitation of homology models and of remote structural rela- 
tionships implies that each new structure that is determined experi- 
mentally can be used to detect large numbers of new functional 
relationships, even if the protein in question is of only limited bio- 
logical interest on its own. In this regard, our approach has benefitted 
from structural genomics initiatives, which produced a large increase 
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Figure 3 | Models for the PPI formed between PRKD1 and PRKCE, and 
EEF1D and VHL using homology models and remote structural 
relationships. a, Model for PRKD1 and PRKCE. b, Model for EEF1D and 
VHL. The same template complex of ubiquitin-conjugating enzyme E2D 3 
(UBE2D3) and ubiquitin (PDB accession: 2FUH; A and B chain, shown in blue 
and red, respectively) was used in both cases. The structures of the PH domain 
of PRKD1 and the GNE domain of EEF1D (shown in green and purple) are 
homology models from ModBase; the structure of a C1 domain of PRKCE 
(yellow) is a homology model from SkyBase; the structure of VHL (cyan) is 
from PDB (accession 1LM8; V chain). In each case, the relevant homology 
models are structurally superimposed on one of the two templates in the 
UBE2D3-ubiquitin complex. 


in the coverage of sequence families that did not have structural repre- 
sentatives. We note that PrePPI appears in many cases to offer a 
viable alternative to high-throughput experiments yielding, in addition 
to a likelihood ofa given interaction, a model (albeit a crude one) of the 
domains and residues that form the relevant protein-protein interface. 
This should in turn facilitate the generation of experimentally testable 
hypotheses as to the presence of a true physical interaction. In con- 
clusion, our study suggests the ability to add a structural ‘face’ for a 
large number of PPIs, and that structural biology can have an import- 
ant role in molecular systems biology. 


METHODS SUMMARY 


Details of the PrePPI algorithm, protein datasets, and experimental validations are 
available in Methods. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Proteins and domains. We obtained the yeast proteome from UniProt*’, and 
parsed its 6,521 proteins into 7,792 domains using the SMART online server”. 
Similarly, for human, we identified 20,318 unique proteome members, producing 
49,851 individual domains. 

Structures. Structural representatives of the entire protein or different individual 
domains were either taken directly from the PDB”, where available, or from the 
ModBase”? and SkyBase'’ homology model databases. PDB structures were iden- 
tified by sequence homology, using a single iteration of PSI-BLAST** and an 
E-value cutoff 0.0001; matching structures in the PDB were required to have 
>90% sequence identity and cover >80% of the query target (the entire protein 
or any domain). Homology models were selected based on two criteria: (1) an E 
value less than 1 X 10°; or (2) an E value less than 1 and either a structure-based 
pG score=0.3, for SkyBase models*’, or a ModPipe protein quality score 
(MPQS) =0.5, for ModBase models. When multiple structures were available 
for a target/domain we chose only one representative using: (1) the PDB structure 
with the best resolution, if available; (2) the ModBase model with the highest 
MPQS score; or (3) the SkyBase model with the highest pG score. On the basis 
of these criteria, we identified 1,361 PDB structures and 7,222 homology models 
for 4,193 different yeast proteins. Among these, 627 proteins could be matched toa 
PDB structure and 3,662 to a homology model, with some proteins having both. 
For human, 14,132 proteins were matched to 8,582 PDB structures and 30,912 
models. Specifically, 4,286 proteins were matched to a PDB structure and 11,266 
were matched to a homology model, with some proteins matched to both. 
Structural neighbours. We used structural alignment tool Ska”* to identify struc- 
tural neighbours. Ska allows alignments to be considered significant even if only 
three secondary structural elements are well aligned. At a protein structure 
distance” (PSD) cutoff of 0.6, we identified 1,448 neighbours (both close and 
remote) per structure for 7,875 structures of 3,911 yeast proteins, and 1,553 
neighbours per structure for 36,743 structures of 13,545 human proteins. 
Template complexes. As of February 2010, there were about 37,000 protein— 
protein complexes involving multiple organisms in the PDB and PQS” databases. 
We used 28,408 and 29,012 complexes as templates during our modelling of yeast 
and human interactions, respectively. PQS terminated updates after August 2009, 
and has been replaced by the protein interfaces, surfaces and assemblies (PISA) 
server’, which will be used in future work. 

Interaction modelling. Given a pair of proteins or domains, we built their inter- 
action model by superimposing their structures with the corresponding structural 
neighbours in the templates (Fig. 1). For yeast, we built 550 million models for 2.4 
million potential PPIs, and for human, we built 12 billion models for 36 million 
potential PPIs. We calculated five structure-based scores for each model (Sup- 
plementary Fig. 1) and used a Bayesian network to combine these scores into an LR 
to evaluate an interaction model (Supplementary Fig. 2) based on the HC and the 
N reference sets (Supplementary Table 1). 

Non-structural clues. For the yeast proteome, we downloaded the raw data for 
four different clues; protein essentiality (ES), co-expression (CE), GO” similarity, 
and MIPS” similarity, from the Gerstein laboratory (http://networks.gersteinlab. 
org/intint/supplementary.htm). We also implemented a measure of phylogenetic 
profile (PP) similarity based on that introduced in reference*! (see later). We 
calculate an LR for each non-structure clue based on our HC and N reference 
sets. For the human proteome, we calculated three different clues following the 
protocol previously described” for GO and CE, and as described later for PP. For 
CE, we used the expression data set (GDS1962), which is one of the most compre- 
hensive microarray studies of 19,803 human genes under 180 different conditions”, 
from the Gene Expression Omnibus”. 

Phylogenetic profile similarity. Using a similar method to that previously 
described“, we calculated a continuous score between 0 and 1 to measure the 
occurrence of a protein and/or domain in 1,156 reference organisms of complete 
proteome information from UniProt. These scores form a phylogenetic profile 
vector (PPV), and the Pearson correlation coefficient (PCC) was used to define the 
similarity between two vectors. For proteins with multiple domains, each domain’s 
PPV is calculated independently, and the highest PCC score of different domain 
pairs is selected as the similarity score between two proteins. Similarity scores for 
pairs of proteins/domains with >40% sequence identity and, of course, for homo- 
meric protein/domain pairs were not calculated. 

The naive Bayes classifier. We combine the different types of clues with each 
other and structural modelling into a single naive Bayes PPI classifier***°: 


LR(C1.25-46n) = HT LR(G) 


Tenfold cross validation. We randomly divided the positive and negative ref- 
erence sets into ten subsets of equal size. Each time, we used nine subsets to train 
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the classifier, and obtained the LR for each protein pair, that is, interaction, in the 
excluded subset from the trained classifier. We repeated the procedure ten times 
using different subsets as training and testing data sets and finally obtained an LR 
for each interaction. We counted the number of true positives (predictions in the 
HC set) and false positives (predictions in the N set) and calculated the prediction 
TPR = TP/(TP+EN), and the FPR (false positive rate) = FP/(FP+TN), to plot the 
ROC curves. In all cases, we have removed structural interaction models based on 
a template that corresponds to an actual crystal structure of the two target proteins. 
Comparison with high-throughput experiments. We retrieved eight high- 
throughput experiment data sets for yeast and three for human (Supplementary 
Table 4). In our comparison, in addition to the HC sets, we also use the same 
reference interaction sets used in the comparative study of different high-throughput 
techniques. These include ~1,300 PPIs (CCSB-BGS) and a subset of 188 highly 
reliable PPIs that are referenced in at least four manuscripts (CCSB-PRS). We 
compiled a new negative reference set, which consists of 440,000 yeast and 
1,750,000 human protein pairs in which each protein in a pair is annotated as 
localized to a different cellular compartment (Supplementary Fig. 7). 

New protein interaction data set. We used 23,779 human protein interactions 
newly deposited into databases after August 2010 as independent validations of 
PrePPI predictions, which were based on pre-2010 data (Supplementary Table 5). 
Co-immunoprecipitation in mammalian cells. Forty-eight hours after transfec- 
tion with indicated expression plasmids, HEK-293T cells were lysed in lysis buffer 
(20mM HEPES, pH 7.9, 100mM NaCl, 0.2mM EDTA, 1.5mM MgCl, 10mM 
KCl, 20% glycerol and 0.1% Triton-X100 for Supplementary Figs 10 and 11; 20 mM 
Tris-HCl, pH 7.5, 150 mM NaCl, 1 mM EDTA and 1% NP-40 for Supplementary 
Fig. 12; and 1X Cell Lysis Buffer (Cell Signaling) for Supplementary Fig. 13) 
supplemented with Protease Inhibitor Cocktail (Roche). Cell lysates were sonicated 
and pre-cleared with 30 kl of Protein G Sepharose (GE) before incubating with 15 pl 
anti-Flag M2 or 40 pl anti-HA Affinity Gel (Sigma-Aldrich) overnight at 4°C with 
shaking. Agarose beads were washed four times with lysis buffer. Lysates (input) 
and immunoprecipitates were denatured in reducing protein sample buffer, ana- 
lysed by SDS-PAGE and immunoblotted with anti-Flag (Sigma-Aldrich), anti-HA 
(Roche), anti-PPAR-y (Santa Cruz), anti-ABL1 (Santa Cruz), anti-ROR2 (Cell 
Signaling) or anti- VEGFR2 (Abcam) antibodies as indicated. 

Protein analysis from brain. Crude membrane fractions were prepared from 
brains of postnatal day (P)0 to P5 wild-type mice or Pedhg"”" mice provided 
by X. Wang. The brain tissues were homogenized in a buffer A (5 mM Tris-HCl, 
pH 7.4, 0.32 M sucrose, 1 mM EDTA, 50 mM dithiothreitol) supplemented with 
the Complete Protease Inhibitor Cocktail. The nuclei and insoluble debris were 
collected by a low-speed centrifugation at 1,000g for 10 min and subsequently the 
supernatant was collected by centrifugation at 22,000g for 30 min. The pellet was 
washed in the buffer A and solubilized in lysis buffer (Pierce). Crude membrane 
fraction (supernatant) was collected by centrifugation at 22,000g for 20 min. 
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Clonal allelic predetermination of 
immunoglobulin-k rearrangement 


Marganit Farago'*, Chaggai Rosenbluh'*, Maya Tevlin'}*, Shira Fraenkel’, Sharon Schlesinger', Hagit Masika', Masha Gouzman', 
Grace Teng’, David Schatz’, Yoach Rais*, Jacob H. Hanna‘, Alexander Mildner”, Steffen Jung®, Gustavo Mostoslavsky®, 


Howard Cedar! & Yehudit Bergman! 


Although most genes are expressed biallelically, a number of key 
genomic sites—including immune and olfactory receptor regions— 
are controlled monoallelically in a stochastic manner, with some 
cells expressing the maternal allele and others the paternal allele in 
the target tissue’. Very little is known about how this phenomenon 
is regulated and programmed during development. Here, using 
mouse immunoglobulin-k (Igz) as a model system, we demonstrate 
that although individual haematopoietic stem cells are charac- 
terized by allelic plasticity, early lymphoid lineage cells become 
committed to the choice of a single allele, and this decision is then 
stably maintained in a clonal manner that predetermines monoallelic 
rearrangement in B cells. This is accompanied at the molecular level 
by underlying allelic changes in asynchronous replication timing 
patterns at the « locus. These experiments may serve to define a 
new concept of stem cell plasticity. 

The k locus undergoes monoallelic rearrangement in B cells. To study 
the developmental programming of this allelic exclusion event, we used 
mice carrying a normal murine constant region (mC) on one allele and 
a human constant region (hCk) on the other’, or Mus musculus/Mus 
castaneous hybrid mice (C57BL/6/M. castaneous (B6/Cast)), and iso- 
lated interleukin-7 (IL-7)-dependent pre-B cell clones (Supplementary 
Figs 1 and 2) from fetal liver. Although pre-B cells themselves have not 
yet undergone rearrangement at this locus, a number of different stud- 
ies have suggested that they may already be programmed to direct the 
process of « rearrangement that occurs after differentiation*». To test 
this hypothesis, we transferred these single-cell pre-B clones to med- 
ium lacking IL-7, thereby inducing them to undergo rearrangement 
(Supplementary Fig. 2c), as occurs in normal B cells®. Using specific 
mCx and hCk polymerase chain reaction (PCR) primers (Supplemen- 
tary Table 1) to pick up the RNA products of rearranged loci, we were 
able to measure independently the extent of recombination for each 
allele (Fig. 1a). Notably, even though the original pool of pre-B cells was 
capable of recombining both alleles equally, individual single-cell 
clones were found to undergo rearrangement almost exclusively on a 
single allele. Clone 7, for example, was mainly able to rearrange the 
hCx allele, whereas clone 10 had the alternative phenotype, with all 
rearrangements apparently occurring on the mC«x allele. 

To test whether this is also true under normal conditions in vivo, we 
used the same hCk/mCk pre-B-cell clones to reconstitute the B-cell 
compartment of severe combined immunodeficiency (SCID) mice, 
and then isolated B220* splenic cells 3 weeks after the original injec- 
tion. In mice reconstituted with clone 7, we found that almost 100% of 
the rearrangement events occurred on the hCx allele. In contrast, 
in vivo B cells derived from clone 10 predominantly expressed mC« 
(Fig. 1b), and similar allele-specific results were also obtained using 
pre-B-cell clones from B6/Cast hybrid mice for reconstitution (Fig. 1c). 


These experiments show that allelic choice is clonally maintained in 
pre-B cells, and that this ultimately restricts the selection of alleles for 
recombination in all derivative B cells. 

We next attempted to understand the molecular basis for this phe- 
nomenon by examining chromatin structure. Allele-specific chromatin 
immunoprecipitation (ChIP) analysis on pre-B-cell clone 7 indicated 
that acetylated histone 3 (H3ac) is preferentially enriched on the hCx 
allele, consistent with our findings that this same allele is the one pre- 
ferred for rearrangement after differentiation. Reciprocally, it is the 
alternative mCx allele that is highly enriched in clone 10 (Fig. 2a). To 
explore allelic differences at other sites along the « locus, we generated 
additional pre-B-cell clones from B6/Cast hybrid mice, and these 
revealed a similar allelic pattern of histone acetylation. Furthermore, 
analysis of these cells demonstrated that the same open allele is also 
marked with trimethylated histone 3 lysine 4 (H3K4me3) over the Jk 
domain (Fig. 2b, c, Supplementary Fig. 3 and Supplementary Table 2). 


a b 
Pool Cl. 7 Cl. 10 Cl. 7 Cl. 10 
mCx hCk mCx hCk mCKx_ hCk mCxk hCk mCKx hCK 
c 
Mouse 1 Mouse 2 Mouse 3 Mouse 4 Mouse 5 
Becl - + - + - + - + = + 
Cl. E93 Cl. G101 Pool 


Figure 1 | Ig allelic choice for rearrangement is predetermined and 
clonally inherited in pre-B cells. a, [gx rearrangement was analysed in two 
mCk/hCk (ref. 3) single-cell-derived clones (clones (Cl.) 7 and 10) induced to 
undergo differentiation in medium lacking IL-7 for 72h. PCR with reverse 
transcription (RT-PCR) analysis was performed using a V« degenerate 
forward primer and a reverse primer, specific either for hCk or mC. A pre-B- 
cell pool was used as a control. b, c, Cells (10’) from hCk/mCkx (b) or B6/Cast 
(c) single-cell-derived pre-B-cell clones were used to reconstitute the B-cell 
compartment of SCID mice. As a control, SCID mice were injected with pools 
of pre-B cells. RT-PCR was performed on RNA derived from B220* splenic 
cells 3-4 weeks after reconstitution. The allelic nature of the each 
rearrangement product was analysed using either allele-specific primers (hCk/ 
mCxk) or an allele-specific restriction site, Bccl, which only recognized the Cast 
allele (B6/Cast). 
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Figure 2 | Igx-allele-specific ChIP analyses in pre-B cells. a, ChIP analysis 
with anti-H3Ac on chromatin derived from pre-B-cell clones 7 and 10 (hCk/ 
mCk). Precipitated DNA was amplified through semi-quantitative PCR using 
primers specific for either hC« or mCx. Data from input and bound fractions 
were obtained using two concentrations of DNA (onefold and threefold). 

b, Schematic view of the « locus in pre-B-cell clones from B6/Cast F1 mice. 
Allele-specific restriction sites generated by single-nucleotide polymorphisms 
(SNPs) between the two alleles were used to distinguish between them; Hhal 
and AhdI recognize only the Bé6 allele, whereas Bccl recognizes the Cast allele. 
c, ChIP analysis on chromatin derived from a B6/Cast single-cell-derived pre-B 
clone (clone 8) using antibodies to H3ac or H3K4me3. Preferential enrichment 
(as indicated) was determined by digesting the chromatin input and bound 
fractions with Hhal or Bccl (shown in b) or by direct sequencing. 
Quantification of results is shown in the histograms. The bars of the histograms 
represent measurements of gel densitometry (left) or direct sequencing (20 
molecules of the input and bound fractions) (right). d, Preferential 
rearrangement of one allele after removal of IL-7 from the medium (60 
molecules were sequenced). e, f, Preferential enrichment of Rag2 or Pax5 in 
clone 8 (e) and E93 (f) was determined by amplifying the 3’ Jk region followed 
by digesting the input and bound chromatin fractions with the indicated 
restriction enzymes (e, left columns, and f), or by sequencing (e, right columns, 
20 molecules were sequenced from the input and bound fractions). 
Quantification of results is shown in the histograms. 


In all cases, the acetylated allele seems to be the preferred target for 
rearrangement (Fig. 2d). 

Recombination itself is carried out by the proteins Rag] and Rag2, and 
previous results demonstrated that Rag2 is already bound to the Jk region 
in pre-B cells’. With this in mind, we did a ChIP analysis on chromatin 
from individual clones and used either restriction enzymes or sequence 
analysis to distinguish between the two alleles. Notably, we found that 
Rag2 always binds preferentially to the acetylated allele (Fig. 2e, f and 
Supplementary Fig. 4), perhaps because of its inherent ability to specif- 
ically identify nucleosomes labelled with H3K4me3 (refs 8, 9). In keeping 
with this idea, the same open allele was also preferentially marked with 
Pax5 (Fig. 2e, f), a protein that binds to well-characterized cis-acting ele- 
ments known to be required for demethylation and rearrangement'®”’. 
Taken together, these experiments demonstrate that the « alleles are 
clonally maintained in a fixed differential configuration in pre-B cells, 
with one allele specifically opened and primed for rearrangement. 

Because these chromatin marks seem to be maintained differentially 
on essentially identical alleles over many cell generations, it appears 
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that there must be some basic underlying feature that directs this pro- 
cess. DNA replication timing is an important domain-wide epigenetic 
mark that is correlated with gene expression. Early replicating DNA is 
closely linked with gene transcription, whereas late replication is assoc- 
iated with gene repression. Furthermore, many monoallelically expressed 
genes are known to replicate asynchronously”’. It has been proposed 
that this property can be clonally transmitted and thus serve as a form 
of epigenetic memory in proliferating somatic cells’’. 

Even though the rearranged allele in B cells is always early replic- 
ating’, there is as yet no direct proof that the presence of this mark in 
pre-B cells can predict which allele will be the first to undergo recom- 
bination after differentiation. To address this question, we carried out 
replication time analysis on a number of individual pre-B-cell clones by 
pulsing with bromodeoxyuridine (BrdU), fractionating S-phase cells, 
and then isolating the BrdU DNA for gene-specific real-time PCR* 
(Fig. 3). In clone 7, we found that the hCx allele replicates earlier than 
the mouse allele, whereas in clone 10 it is the mCx allele that replicates 
early. These results show that there is a one-to-one correlation between 
allele-specific early replication, histone acetylation, and the predisposi- 
tion to undergo rearrangement after differentiation to B cells. 

It has been demonstrated that replication timing itself strongly 
influences the repackaging of chromatin after DNA synthesis, with 
deacetylated histones being assembled onto DNA that replicates in 
late S phase, whereas early replicating DNA is differentially repackaged 
with nucleosomes containing acetylated histones'*’*. It is thus not 
difficult to imagine how asynchronous replication itself may cause 
the two « alleles to be in different configurations within the same cell. 
Furthermore, as domain-wide replication timing is itself controlled by 
histone acetylation at origin modules”, this same mechanism may also 
explain how differential replication can be autonomously maintained 
over many cell generations’®. 

The « locus replicates asynchronously in embryonic stem (ES) cells, 
and this same pattern is also observed in a wide range of different 
somatic cell types*'*. Because in some pre-B-cell clones the paternal 
K allele replicates early whereas in others it replicates late, it is clear that 
there must be a stage in development at which these patterns become 
clonally fixed. To trace this process, we first analysed the « locus for 
clonal asynchronous replication in ES cells. This was accomplished by 
growing cells carrying the Rosa26 marker on one allele of chromosome 
6 and using fluorescence in situ hybridization (FISH) probes to identify 
this locus'’. A polyclonal population of ES cells shows that in 50% 
of nuclei the early « allele is associated with the Rosa26 marker, 
whereas the other 50% show the same allele replicating late. Notably, 
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Figure 3 | Asynchronous Ig« replication. Replication timing was analysed by 
cell-cycle fractionation of BrdU-labelled clones 7 and 10 (see Methods). The « 
locus was amplified by PCR using specific primers for hCk (black) or mCk 
(grey). PCR analysis of a control region (mouse globin) showed a completely 
synchronous replication pattern (data not shown). Graphs represent 
quantification of gel panels shown below. 


©2012 Macmillan Publishers Limited. All rights reserved 


all single-cell-derived ES subclones show this same bimodal pattern 
(Supplementary Table 3 and Supplementary Fig. 5), indicating that 
although the « locus always replicates asynchronously in these cells, 
the replication-time mode for each allele is not maintained in a clonal 
manner, as opposed to fibroblasts (Supplementary Table 3). This same 
form of allelic plasticity in ES cells has also been observed for the X 
chromosome”’, as well as other monoallelic loci?!. 

On the basis of this idea, we next tried to determine when during 
B-cell development the clonal choice of allele is established at the 
functional level. To this end, early haematopoietic stem cells (HSCs) 
from the hCk/mCx hybrid mice were isolated, infected with a lentiviral 
vector that integrates stochastically at numerous sites in the genome”, 


a HSC b HSC 

1 2 3 4 1 2 3 4 
Ck hmhmhmihm a 
c d 


Figure 4 | Establishment of Igv allelic choice. a, HSCs were isolated from 
hCk/mCk mouse bone marrow and infected with lentiviral vector expressing 
GFP. SCID mice were reconstituted with GFP-expressing cells, spleens were 
extracted 2-3 months later, and hCx- and mCk-expressing B cells were 
separated by FACS. DNA was digested with BamHI and subjected to Southern 
blot analyses using a fragment recognizing the WPRE element in the lentiviral 
construct as a radioactive probe. b, SCID mice were reconstituted with DS-Red- 
infected HSCs extracted from GFP */~ female mice in which the GFP construct 
has been inserted into the X chromosome. Splenic B cells were sorted for GFP* 
and GFP cells and viral integration sites were analysed as in a. c, d, MPPs (c) or 
CLPs (d) were isolated from the bone marrow of hCk/mCxk mice and treated as 
in a. SCID mice were reconstituted with GFP-expressing cells and FACS-sorted 
splenic hCx- and mCk-expressing B cells were analysed for viral integration 
using linear-amplification-mediated PCR (LAM-PCR)”. c, d, Integration 
identical in hCx- and mCx-expressing cells is indicated by black arrows; 
integration specific for mCx is indicated by green arrows; and integration 
specific for hCk is indicated by blue arrows. 
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and then used to reconstitute the immune system in SCID mice. 
After 3 months, B cells were isolated from the spleen and subjected 
to fluorescence-activated cell sorting (FACS) to separate cells that 
express hCk from those that express mCk. DNA was isolated from 
these two cell fractions, and subjected to Southern blot analysis to 
identify viral integration sites (Fig. 4a and Supplementary Fig. 6a). This 
analysis showed that hCx and mCx cells have identical integration-site 
patterns, indicating that individual HSCs do not have a fixed prede- 
termined allelic pattern at the « locus, but rather seem to maintain the 
potential to choose either allele. 

As an internal control for these experiments, we reconstituted SCID 
mice with virally infected HSCs from female animals containing a GFP 
expression marker on one of their X chromosomes” and then isolated 
splenic B cells. Because this reporter gene is on the X chromosome, we 
could separate out GFP* and GFP” B-cell populations. Both fractions 
showed distinct viral integration patterns (Fig. 4b and Supplementary 
Fig. 6b). Thus, whereas the allelic choice of X chromosome is already set 
up in adult HSCs, presumably as a result of a commitment made during 
early development, the « locus retains plasticity in these same cells. 

We next carried out similar tracing experiments on multipotent 
progenitors (MPPs) and, in this case also, individual cells were able 
to choose either allele (Fig. 4c and Supplementary Fig. 6c). In contrast, 
hCk and mCx cell populations derived from common lymphoid pre- 
cursors (CLPs) showed a mixed pattern, with some integration bands 
coming from cells that were already committed to one particular « 
allele, whereas other bands were seen in both hC« and mCxk expressers, 
and were thus representative of cells that had not yet made a commit- 
ment (Fig. 4d and Supplementary Fig. 6d). As expected, pro-B cells had 
already chosen a fixed allele (Supplementary Fig. 7), consistent with 
our observations that histone acetylation, H3K4 methylation and Rag2 
allelic binding are already set up in a clonal manner at this stage of 
development (Supplementary Fig. 8). 

We next asked whether the transition from plasticity to commit- 
ment at the functional level is also associated with underlying changes 
at the epigenetic level. To this end, we isolated HSCs, MPPs and CLPs 
from mice carrying a large deletion on one allele of the Tcrb gene”, and 
cultured them to generate single-cell subclones. Using double-label 
FISH on interphase nuclei, we analysed asynchronous replication 
timing with a «-specific probe, and allele specificity by probing the 
Tcrb locus in these same cells. In HSCs and MPPs, the « locus was 
found to replicate asynchronously, with either allele being the early one 
in each subclone (Table 1 and Supplementary Fig. 9), indicating that 
allele specificity is constantly switching in cells at these early stages of 


Table 1 | Asynchronous replication clonality during haematopoiesis. 


CK 

Clone Paternal Maternal 
HSC B10 25 23 
HSC C9 23 21 
HSC F10 17 19 
HSC H5 23 22 
HSC H10 24 22 
MPP B6 25 27 
MPP E8 26 23 
MPP F2 24 22 
MPP G1 21 21 
MPP H11 23 25 
CLP BS 21 3 
CLP C3 10 8 
CLP E6 6 24 
CLP EIT 32 4 
CLP G6 32 z 
CLP H6 33 7 
CLP H10 22 19 


HSC, MPP and CLP single-cell clones originating from mice carrying a large deletion on the Terb 
paternal allele gene were assayed by FISH for asynchronous DNA replication by counting the number of 
Cx single/doubles, using the Cx and Terb probes. The number of paternal early or maternal early nuclei 
was determined by reference to a Terb probe. 
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haematopoiesis. Although several CLP subclones showed this same 
behaviour (CLP C3 and H10), others revealed a clonal pattern with 
either the paternal or maternal allele replicating early in every cell 
(Table 1). These clones have made a molecular commitment to one 
particular allele. Interestingly, FISH analysis of additional asynchron- 
ous markers on the same chromosome showed a pattern parallel to 
that of the « locus (Supplementary Table 4), suggesting that both 
plasticity and selection of alleles operates on a chromosome-wide level. 

The establishment of random asynchronous replication seems to take 
place in three steps during normal development. In the pre-implantation 
embryo, the patterns present in the gametes are erased so that these loci 
replicate synchronously. Asynchronous replication in the developing 
organism is only initiated at about the time of implantation*’*, and 
experiments in ES cells suggest that these patterns are not clonally main- 
tained at this stage (Supplementary Table 3). Rather, these cells must be 
constantly switching their alleles between early and late replication after 
each cell division. Finally, allele-specific replication becomes clonally 
fixed in differentiated somatic cells, generating a mixed population 
containing some clones that always choose the maternal allele to rep- 
licate early, with others replicating the paternal allele early (Table 1). 
This occurs as a chromosome-wide primary event, independent of 
allele-specific expression*’*** (Supplementary Table 4). 

This transition from the switching state to the clonally inherited state 
has never been studied at the functional level, to our knowledge. Our 
results on the « locus show that this domain replicates in a switchable 
manner in HSCs, but becomes clonally fixed at the time of lymphoid 
commitment. Notably, we have shown that the pattern of replication 
can predict the choice of allele for rearrangement, both in pro-B- and 
pre-B-cell clones. Before this, HSCs and MPPs still retain the ability 
to generate clonal populations that can chose either the maternal or 
paternal allele for rearrangement. This suggests that these adult stem 
cells not only have the potential to differentiate into multiple lineages, 
but also to maintain the plasticity required to switch between alleles. It 
thus appears that HSCs themselves are derived from precursor popula- 
tions tracing back to the original ES cell, and that they probably retain 
allelic plasticity throughout this process. These results indicate that the 
ability to generate allele diversity is still present in adult stem cells, 
explaining how the entire B lineage compartment can be generated from 
a single stem cell’’. Therefore, it seems that allelic plasticity is a common 
feature of stem cells regardless of their developmental potency. 

Taken together, our experiments indicate that chromatin has a cis- 
acting role in organizing stable regulatory signals on the DNA that 
can direct subsequent allelic choice during B-lineage differentiation. 
This structural organization is consistent with the overall logic of 
haematopoeisis. As opposed to the developmental kinetics of stem 
cells involved in organ formation during embryogenesis, the dynamic 
generation of blood cells in the adult seems to be designed in a manner 
that allows each sub-branch of the system to operate autonomously to 
enable a steady supply of each cell type according to its own needs. It 
may be for this reason that the network is built with so many different 
semi-stable intermediate stages programmed for specific tasks, and it is 
likely that epigenetic pre-marking is what makes this entire system 
possible’*. Our studies fit into this plan by showing that allelic choice is 
also programmed in a similar manner. 


METHODS SUMMARY 


Pro-B- and pre-B-cell clones were prepared from fetal liver harvested at 11.5-13.5 
days post-coitum (d.p.c.) from hybrid hCk/mCx or B6/Cast mice. Cells were grown 
onan ST2 feeder layer in medium containing IL-7 (ref. 6). After 10-14 days of IL-7- 
mediated positive selection, cells were plated on 96-well plates in limiting dilutions 
to generate single-cell-derived pre-B- or pro-B-cell clones. ChIP was carried out as 
previously described*”. IL7-dependent pre-B-cell clones were analysed for their 
replication timing by S-phase fractionation, as previously described*. HSCs, MPPs 
and CLPs isolated from LN3 (VB 1N7/*) mice” were FACS sorted (using cell surface 
markers) and single-cell cloned in 96-well plates by limiting dilution. FISH analysis 
was performed as previously described’. FACS-sorted pro-B cells, CLPs, MPPs and 
HSCs were purified from bone marrow of hCk/mCxk mice and were infected with 
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lentiviral vectors expressing GFP’. B cells were FACS sorted either for the express- 
ion of hCk or mCx, or for GEP* and GEP” subpopulations. Viral insertion site 
patterns were detected either by Southern blot analysis or by LAM-PCR, as previ- 
ously described”*. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Animals. Mice carrying a human constant region (hCx) (gift from M. Nussenzweig) 
were crossed with BALB/c wild-type mice, yielding the hCx/mCx hybrid. C57BL/6 
(B6) mice were crossed with wild-type M. casteneous (Cast) mice (Jackson Labo- 
ratories) to generate the B6/Cast hybrid. Rag] ~/~ mice (Jackson Laboratories) were 
crossed with hCk mice to generate Rag!’ hCx/hCx homozygotes, which were 
further crossed with Rag] ’~ mice to generate the Rag’ hCx/mCx genotype. 
LN3 (VBIN™’*) mice were re-derived from the appropriate ES cells containing the 
preassembled V$1Df1Jf1.4Cf1 gene by tetraploid embryo complementation, as 
previously described***’. Cells from these mice harbour a wild-type Tcrb allele as 
well as a preassembled allele that contains a large deletion resulting from the 
rearrangement process. Mice were housed and cared for under specific patho- 
gen-free conditions, and all animal procedures were approved by the Animal 
Care and Use Committee of the Hebrew University of Jerusalem. 

Cells, cultures and flow cytometry. To prepare pro-B- and pre-B-cell clones, fetal 
liver cells from the embryos described above were harvested at 11.5-13.5 days 
post-coitum (d.p.c.) and grown on an ST2 feeder layer (provided by A. Rolink) in 
RPMI 1640 medium supplemented with 10% FCS, 2 mM L-glutamine, 100 U ml! 
of penicillin, 100 1g ml~* streptomycin and 504M 2-mercaptoethanol in the 
presence of IL-7 (conditioned medium collected from J558-IL7 secreting cells, 
as provided by A. Rolink)*. After 10-14 days of IL-7-mediated positive selection, 
cells were plated on 96-well plates in limiting dilutions to generate single-cell- 
derived pre-B- or pro-B-cell clones. Clones were expanded and analysed by flow 
cytometry. Fluorochrome-labelled monoclonal antibodies— FITC-conjugated anti- 
CD25, APC-conjugated anti-c-Kit, PE-conjugated anti-B220 and APC-conjugated 
anti-IgM (eBioscience)—were used in accordance with the manufacturer’s instruc- 
tions. Cells were analysed with a FACS Calibur cytometer (Beckton-Dickinson), 
using CellQuest software (Beckton-Dickinson). 

ChIP analysis. Cells were cross-linked, chromatin extracted and immunopreci- 
pitated*” using antibodies directed against acetylated histone H3K9, H3K14 (5 1g 
per 30 ung DNA), H3K4me3 (5 pg per 30 ug DNA) (Upstate Biotechnology) and 
Pax5 (5 tg per 30 jg DNA) (SantaCruz). ChIP using anti-Rag2 was performed as 
previously described’. Enrichment of 3’ J« or Jk5 regions was determined by real- 
time PCR analysis (Applied Biosystems 7300), with JkAhdI L+R primers using 
SYBR Green (Applied Biosystems) or with Jk5 primers using TaqMan (QIAGEN). 
Enrichments were calculated as follows: enrichment = 2CTBound = CTnpud x 19? 
The values were then normalized to negative control genes. A PCR primer list can 
be found in Supplementary Table 1. ChIP results recorded in this paper are all 
based on multiple (2-3) individual antibody precipitations. 

A number of experimental approaches were used to determine allele-specific 

enrichment. To distinguish between the hC« and the mCx alleles, the input and 
bound fractions were analysed by semi-quantitative PCR (with P**-labelled dCTP) 
using hCk- and mCk-specific primers (hC« L+R or mCk L+R) (Supplementary 
Table 1), as described*. Gels were then quantified using the EZQuant software and 
the relative contribution of allelic enrichments was then determined. B6 and M. 
castaneous alleles were distinguished by using known and verified (by sequencing) 
SNPs. After PCR amplification with Jkseq L+R primers, products from the input 
and bound fractions were extracted from gels using the Qiagen MinElute Gel 
Extraction Kit and cloned using the Promega pGEM-T vector system. DNA from 
20 white colonies of each ligation was amplified and subjected to sequencing with 
T7 or Sp6 primers. Alternatively, we made use of allele-specific restriction enzyme 
recognition sites to restrict the input and bound DNA fractions. After separation by 
PAGE and quantification, allelic enrichment was calculated as follows: percentage 
enrichment of digested allele = 100 X (1 + (Bound ncur/Bound yota:Inputypcut! 
Inputrota)) | Primers for ChIP-allelic determination were CxKBCCI L+R, 
J«Hhal L+ R and JxAhdI.alt L+L+R (Supplementary Table 1). 
Replication timing. FISH analysis was performed as previously described’. 
Briefly, cells were pulse labelled for 1h with BrdU and isolated nuclei were then 
fixed with methanol:acetic acid (3:1). After denaturation and dehydration, slides 
were hybridized with labelled probes detecting both the Cx and Rosa26 loci. BrdU 
was detected by an anti-BrdU antibody (NeoMarkers) followed by rhodamin- 
labelled anti-mouse antibody (Jackson Laboratories). ES cells harbouring the 
IiMOG transgene in their Rosa26 locus’? were single-cell cloned in 96-well plates 
by limiting dilution, before FISH analysis. Replication timing profiles (per cent 
single/single, single/double and double/double) were determined by counting 
30-100 BrdU-positive nuclei for each clone. 

HSCs, MPPs and CLPs isolated from LN3 (vpiN” *) mice were FACS sorted 
(using the following cell surface markers: HSCs, Lin, Sca-1*, c-Kit*, FLK2°; 
MPPs, Lin, CD27", CD19, FLK2", IL7Ro, c-Kit* and Sca-1*; CLPs, Lin’, 
CD27*, CD19~, FLK2*, IL7Ra*, c-Kit*, Sca-1~) and single-cell cloned in 
96-well plates by limiting dilution. Cells were grown on OPTIMEM media sup- 
plemented with 10ngml~' SCF, 10ngml~' FLT3L and 10ngml' IL-7. In 
addition, HSCs were also supplemented with 20 ng ml’ mouse thrombopoietin 


and 20ngml~' mouse IGF-II. FISH analysis was performed as described above 
using a rhodamin-labelled bacterial artificial chromosome (BAC_ (RP24-297J5) 
that specifically recognizes a large region deleted on the pre-rearranged Tcrb 
allele***', and FITC-labelled BACs containing Cx (RP24-387E13), the olfactory 
receptor cluster 453, 454 and 38 (RP23-62H2) or Nanog (RP23-117123), all located 
on chromosome 6. 

IL-7-dependent pre-B-cell clones were analysed for their replication timing by 
S-phase fractionation, as previously described’. Briefly, cells were pulse labelled in 
BrdU before collection, and nuclei were then FACS sorted according to DNA 
content. BrdU DNA was isolated from each fraction and assayed using primers 
specific for hCk and mCk (hCk L+R or mC L+R). PCR products were electro- 
phoresed and quantified by autographic scanning both for total PCR product and 
then for each allele individually to determine the proportion of the hCk or mCK 
alleles in each lane. 

Igh and Igx rearrangement analyses. Heavy chain rearrangement was assessed 
as previously described*. « light chain rearrangement was assessed in IgM~ 
splenic and bone marrow B cells as well as in single pre-B-cell clones that were 
induced to differentiate after IL-7 withdrawal for 48-72 h. mRNA was reverse 
transcribed using MMLV-RT (Promega) and then PCR amplified using exon- 
spanning primers (Supplementary Table 1). For the hCk/mCk IL-7-dependent 
clones, a V« degenerate 5’ primer (VkDeg5’)** was used together with hCx/mCk- 
specific primers (hCk R or mCk R). For the B6/Cast pre-B-cell clones the same V« 
degenerate primer together with the CkBCCI R primer were used. The resulting 
PCR product was gel cleaned and then subjected to a second round of PCR 
using CkBCCI L+R primers. This PCR product was analysed either by allele- 
specific restriction enzymes, or by sequencing, as described earlier. Quantitative 
analysis of rearrangement (qPCR) was performed either at the DNA level using 
specific VK4-74L and V«4-78L primers together with either Jk1L or Jk4L primers, 
or at the RNA level by RT-PCR using V«4-74L and V«4-78L primers together 
with a Ck primer (CKBCCI R). In both cases results were normalized to control 
genes. 

In vivo reconstitution. 10’ cells from single-cell-derived clones or pools were 
injected into the tail vein of Rag!’ SCID mice. Three-to-four weeks after injec- 
tion B220* cells were detected in the peripheral blood of injected mice, and both 
bone marrow and spleens were then harvested. RT-PCR was performed on RNA 
extracted from B220* cells (sorted by FACS), using V« degenerate 5’ primers 
(V«Degl, V«Deg2 and V«Deg3)*° together with hCx/mCk-specific primers 
(mCk R1 or hCk R1) or using the CkBCCI R primer in the case of B6/Cast cells, 
and the allele specificity of the rearrangement products was then analysed as 
described earlier. 

Allelic choice plasticity assay. HSCs were purified from hCx/mCk mice and 
from female animals containing a GFP expression marker on one of their X 
chromosomes”, and were infected with lentiviral vectors expressing either GFP 
or DS-Red, as described previously”*”®. Positively infected cells were injected into 
SCID mice, and spleens were then harvested after 3-6 months. B cells were FACS 
sorted either for the expression of hCk or mCk, or for GEP* and GEP™ sub- 
populations, and genomic DNA was extracted. To analyse viral integration, Southern 
blot analysis was performed on DNA digested with BamHI (New England Biolabs) 
that cuts only once within the provirus, and probed with a fragment recognizing the 
WPRE element present in the lentiviral vector. 

FACS-sorted pro-B cells (CD19*, c-Kit"), CLPs (Lin’, CD27*, CD19", 
FLK2", IL7Ra”, c-Kit", Sca-1”), MPPs (Lin, CD27", CD19", FLK2"*, 
IL7Ro._, c-Kit*, Sca-1*) and HSCs (Lin , Sca-1*, c-Kit*, FLK2_ ), were purified 
from bone marrow of hCk/mCxk mice, were infected with lentiviral vectors expres- 
sing GFP, and were treated as described earlier. Insertion site patterns were 
detected using LAM-PCR, as described previously”. Briefly, genomic DNA 
from splenic B cells expressing either hCk or mCk was linear amplified with 
biotinylated primers (3’ LTR-IupB and 3’ LTR-IdownB) and then immobilized 
on magnetic beads (Dynabeads M-280 Streptavidin (Dynal)). Double-strand 
DNA was prepared with Klenow polymerase, followed by digestion with a four 
cutter restriction enzyme (Tsp5091) and by ligation with a linker cassette. After 
denaturation, nested PCR with linker- and vector-specific primers was applied 
(using LTR-IIB with LC-I primers for the first run, and LTR-III with LC-II primers 
for the second PCR). Each sample was analysed at least two times. Gels were 
quantified by averaging pixel intensity along the x-axis of each lane and sub- 
tracting the lane-specific background (calculated as the average pixel intensity 
for the whole lane). Intensity differences were taken into account by adjusting 
the minimum and maximum peaks of each lane to be 0 and 1, respectively. A 
line plot describing the relative intensity along the y-axis of each lane was then 
generated. 
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Crystal structure of the multidrug transporter 
P-glycoprotein from Caenorhabditis elegans 


Mi Sun Jin!, Michael L. Oldham”, Qiuju Zhang? & Jue Chen!” 


P-glycoprotein (P-gp) is an ATP-binding cassette transporter that 
confers multidrug resistance in cancer cells’. It also affects the 
absorption, distribution and clearance of cancer-unrelated drugs 
and xenobiotics. For these reasons, the structure and function of 
P-gp have been studied extensively for decades’. Here we present 
biochemical characterization of P-gp from Caenorhabditis elegans 
and its crystal structure at a resolution of 3.4angstréms. We 
find that the apparent affinities of P-gp for anticancer drugs 
actinomycin D and paclitaxel are approximately 4,000 and 100 times 
higher, respectively, in the membrane bilayer than in detergent. This 
affinity enhancement highlights the importance of membrane par- 
titioning when a drug accesses the transporter in the membrane’. 
Furthermore, the transporter in the crystal structure opens its drug 
pathway at the level of the membrane’s inner leaflet. In the helices 
flanking the opening to the membrane, we observe extended loops 
that may mediate drug binding, function as hinges to gate the path- 
way or both. We also find that the interface between the transmem- 
brane and nucleotide-binding domains, which couples ATP 
hydrolysis to transport, contains a ball-and-socket joint and salt 
bridges similar to the ATP-binding cassette importers’, suggesting 
that ATP-binding cassette exporters and importers may use similar 
mechanisms to achieve alternating access for transport. Finally, a 
model of human P-gp derived from the structure of C. elegans P-gp 
not only is compatible with decades of biochemical analysis~”, but 
also helps to explain perplexing functional data regarding the 
Phe335Ala mutant'*"*. These results increase our understanding 
of the structure and function of this important molecule. 
P-glycoprotein uses the energy from ATP hydrolysis to pump 
substrates across the cell membrane. Drug transport depends on ATP 
hydrolysis'*"°, and the ATPase activity of P-gp is stimulated by the 
transported drugs'”'’. Caenorhabditis elegans P-gp is 46% identical to 
human P-gp. To ensure that this level of sequence identity translates 
into functional similarity, we tested whether overexpression of 
C. elegans P-gp confers cellular resistance to cytotoxic drugs known 


to be transported by human P-gp. Two potent anticancer drugs, 
actinomycin D and paclitaxel (Taxol) kill Spodoptera frugiperda (Sf9) 
cells at concentrations greater than 0.1 1M (Fig. 1a). In comparison, 
cells infected by recombinant baculoviruses carrying the C. elegans 
P-gp gene are resistant to these drugs in concentrations more than 
1,000-fold higher (Fig. la and Supplementary Fig. la). Using 
detergent-purified protein, we measured whether substrates of human 
P-gp also stimulate the ATPase activity of C. elegans P-gp. Among the 
30 compounds we screened, actinomycin D, paclitaxel, progesterone, 
dipyridamole, and valinomycin increased the ATPase activity of 
C. elegans P-gp more than fivefold (Fig. 1b and Supplementary Fig. 1b). 
As expected for an ATP-binding cassette (ABC) transporter, both the 
basal and drug-stimulated levels of ATPase activity are inhibited by 
vanadate (Fig. 1b). The ATPase activity plotted as a function of drug 
concentration shows the characteristic biphasic response: with increas- 
ing drug concentration, ATPase activity increases to a maximum value 
and then decreases (Fig. 1c). This behaviour has been studied exten- 
sively with human P-gp*°’!. Like the P-glycoproteins identified in 
other species”, the substrate profile of C. elegans P-gp only partly 
overlaps with that of human P-gp. Nevertheless, the similarities 
between their amino-acid sequences and functional properties suggest 
that the structure of C. elegans P-gp would be a reasonable starting 
point for a mechanistic understanding of how human P-gp functions 
as a multidrug pump. 

P-glycoprotein is a single polypeptide with two homologous halves, 
each containing a transmembrane domain (TMD) and a cytoplasmic 
nucleotide-binding domain (NBD) (Fig. 2a). Crystals of C. elegans 
P-gp were obtained in the absence of nucleotides and transport 
substrates. The structure was determined at a resolution of 3.4 A and 
the amino-acid assignments were confirmed by 34 selenium-labelled 
methionines and three mercury-labelled cysteines (Supplementary 
Fig. 2 and Supplementary Table 1). 

Compared with the crystal structure of mouse P-gp determined at a 
resolution of 3.8 A (ref. 23), that of C. elegans P-gp shows a similar 
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Figure 1 | Caenorhabditis elegans P-gp is a multidrug transporter. 

a, Cytotoxicity assay. Spodoptera frugiperda Sf9 cells expressing P-gp were 
cultured with various concentrations of actinomycin D (ActD, blue line) or 
paclitaxel (red line). Uninfected cells were cultured in the presence of the same 
drugs as controls (dashed lines). b, ATPase activity in the presence and absence 


of 1mM orthovanadate. c, ATPase activity as a function of substrate 
concentration. The protein concentration was kept at 0.15 uM for all 
measurements. Data points represent the means and standard deviations of 
triplicate measurements from the same preparation. 
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Figure 2 | The molecular architecture of P-gp. 
a, Secondary structure. b, Ribbon presentation. 


c, The transmembrane cavity (green mesh) open to 
the cytosol and continuous with the membrane 
inner leaflet. The loops in TM10 and TM12 are 
coloured red. d, Drug-stimulated ATPase activities 
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inward-facing conformation (that is, one open to the cytoplasm), with 
a larger degree of separation between the two NBDs (Fig. 2b and 
Supplementary Fig. 3). Superposition of the transmembrane helices 
individually shows that sequence assignments of six comparable 
helices (TM1, TM2, TM6, TM7, TM8 and TM11) agree well. The 
two structures have different conformations in TM9, TM10 and 
TM12 and these helices thus cannot easily be compared. Helices 
TM3, TM4 and TM5, which can be compared, show important 
differences due to register shifts in model building (Supplementary 
Figs 4-6). As we will discuss later, these corrections in the structure 
are relevant to the identification of drug-interacting amino acids and 
an accurate definition of the NBD-TMD interface. 

In the crystal structure, the drug transport pathway is open to the 
cytoplasmic surface and is continuous with the membrane inner leaflet 
(Fig. 2c). Thus, in principle, drugs could gain access to the transport 
pathway from the aqueous phase as well as the membrane. It has been 
proposed that P-gp interacts with drugs from the inner leaflet of the 
membrane instead of the cytoplasm‘ (the ‘hydrophobic vacuum’ model). 
On the basis of this hypothesis, P-gp would be expected to be more 
sensitive to drug stimulation in membranes than in detergents, because 
most P-gp substrates are highly concentrated in the membrane, with 
partition coefficients ranging from 100 to 10,000 (ref. 24). In detergents, 
400 LM actinomycin D or 10 uM paclitaxel was required to obtain the 
maximum ATPase activity of purified P-gp (Fig. 1c). The addition of lipid 
molecules does not change the apparent drug affinities. Using mem- 
branes from yeast cells overexpressing C. elegans P-gp, we showed that 
both drugs stimulate the ATPase activities at a concentration of 0.1 1M 
(Fig. 2d), which is a 4,000-fold increase in sensitivity for actinomycin D 
anda 100-fold increase for paclitaxel. The degree of stimulation was lower 
when using the membrane sample than when using purified P-gp in 
detergent, probably owing to baseline activity of other cellular ATPases 
in the yeast cell membranes. We note that the dependence on drug 
concentration of ATPase stimulation in membranes matches well the 
concentration dependence observed in the cell protection assay (Figs la 
and 2d). The shift in drug sensitivity in membranes supports the 
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in the absence and presence of vanadate (1 mM) are 
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determinations from the same preparation. These 
results have been reproduced using different 
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hypothesis that drugs enter the transporter through the membrane’s 
inner leaflet. Such a mechanism, whereby transport depends on an 
elevated local drug concentration due to membrane partitioning, also 
implies that the intrinsic affinity of drugs for the transporter itself is 
low, consistent with the ability of P-gp to transport many different 
compounds. 

The structure of C. elegans P-gp presents an interesting variation on 
the theme of membrane access, in that only one lateral opening is patent 
(Fig. 2c). An amino-terminal helical hairpin is inserted into the other 
lateral opening observed in the bacterial exporter MsbA (ref. 25) and 
mouse P-gp** between TM4 and TM6 (Fig. 2b). A truncation mutant 
devoid of the helical hairpin functions similarly to the full-length 
protein in a cytotoxic assay (Supplementary Fig. 7a) but has a reduced 
maximum level of stimulation in an ATPase activity assay (Sup- 
plementary Fig. 7b). The dependence on drug concentration, however, 
is unaltered for the truncation mutant (Supplementary Fig. 7b). 

Another notable feature of C. elegans P-gp is a discontinuity of 
helices TM10 and TM12 lining the lateral opening (Fig. 2c). We can 
think of two possible reasons why the helical secondary structure gives 
rise to more-disordered loop structures in this region. The first is that a 
breakdown of secondary structure creates a greater number of possible 
interactions between protein atoms and drugs entering the pathway. In 
other transporters where discontinuous transmembrane helices have 
been observed, the loops inside the membrane have been shown to bind 
substrates*®. Thus, loops flanking the lateral opening could assist drug 
recognition and entry. The second possible reason is that these loop 
regions could function as flexible hinges to gate the pathway, mediate 
conformational changes associated with drug transport or both. 

The NBD-TMD interface is important in ABC transporters because 
it transmits conformational changes associated with ATP hydrolysis to 
substrate translocation. In all ABC importers for which structures 
have been determined, the TMD is connected to the NBD through a 
‘ball-and-socket’ joint’. This joint consists of a cytoplasmic helix from 
the TMD known as the coupling helix or EAA loop (the ball), which 
docks into a cleft on the NBD surface (the socket) (Fig. 3a). A similar 
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Figure 3 | Interactions between the TMDs and 
NBDs. a, NBD-TMD interface of the maltose 
importer. MalF is the transmembrane subunit and 
MalK is the NBD. Glu 401 is the conserved 
glutamate residue in the EAA loop. b, c, NBD1- 
TMD (b) and NBD2-TMD (c) interfaces in P-gp. 
Dash lines indicate the conserved salt bridges and 
hydrogen bonds. WA, Walker A motif; WB, 
Walker B motif. 
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structural feature is observed in P-gp, in which intracellular helices 
TH2 and IH4 on the TMDs resemble the coupling helices and engage 
the same surface of the NBDs (Fig. 3b, c). In addition to the ball-and- 
socket joint, ABC exporters, including P-gp, have an additional set of 
interactions in which IH1 and IH3 also interact with the NBDs, 
creating a more extensive interaction surface between the TMDs 
and the NBDs (Fig. 3b, c). The importance of this interface in P-gp 
to both structure and function is underscored by the high degree of 
amino-acid conservation, which approaches 80% identity between 
C. elegans and human P-gp (Supplementary Figs 8 and 9). 

Multiple crystal structures of the maltose importer show that the 
transition between different structures is accompanied by rotations of 
the EAA loop inside the cleft’’. Salt bridge interactions seem to be 
central to tethering a pivot point in these conformational changes; 
elimination of the salt bridges by double mutations results in a 
defective transporter and dissociation of the MalK subunit**. For the 
NBD1-TMD interface in P-gp, three highly conserved residues, 
Arg 946 (IH4), Asp 188 (IH1) and Tyr 468 (NBD1), are engaged in a 
network of salt bridge interactions (Fig. 3b). The same atomic 
interactions are also observed in the NBD2-TMD interface, involving 
residues Arg 286 (IH2), Asp 846 (IH3) and Tyr 1129 (NBD2) (Fig. 3c). 
It is likely that these interactions in P-gp have a role similar to the salt 
bridge interactions in importers such as the maltose transporter. 

To assist structural and functional analysis of human P-gp, we 
generated a homology model of human P-gp based on the structure 


of the C. elegans P-gp determined in this study. The modelled structure 
is consistent with a large body of biochemical and biophysical data on 
human P-gp (Fig. 4 and Supplementary Fig. 10). For example, pairs 
of residues in the TMDs shown by cysteine mutagenesis to form 
disulphide bonds®’ are located in close proximity to each other 
(Fig. 4b, c). The NBD-TMD interfaces are consistent with data 
showing that Ala 266 in TMD1 is in close proximity to Phe 1086 in 
NBD2” and that cysteines introduced at positions 443 and 474 in 
NBDI crosslink with residues 909 and 905 in TMD2, respectively” 
(Fig. 4d, e). Residues protected by drug substrates from inhibition by 
thiol-reactive analogues are suggested to form the drug-binding 
region®”’. In the modelled structure, 17 of the 19 residues are dis- 
tributed on the surface of the drug-translocation pathway (Fig. 4f). 
Most of these residues are non-polar, consistent with the hydrophobic 
nature of the substrates. In contrast, the 17 residues that were not 
protected by substrate, and therefore are presumed to be not involved 
in drug binding", are scattered throughout the TMDs (Fig. 4f). 
Previously, the orientations of TM3, TM4 and TM5 with respect to 
the drug-translocation pathway predicted by arginine-scanning 
mutagenesis were incompatible with the crystal structure of mouse 
P-gp”. On correction of amino-acid registry (Supplementary Figs 5 
and 6), these biochemical data are now consistent with the homology- 
modelled human P-gp (Supplementary Fig. 10). 

The atomic structure of P-gp offers insights into functional data that 
otherwise are difficult to explain. For example, mutants that enhance 


Figure 4 | A model of human P-gp. a, Overall structure. b, c, Pairs of residues 
in TMDs that formed disulphide bonds (green line) when mutated to 
cysteines’. d, e, Pairs of residues at the NBD-TMD interfaces that were 
crosslinked'*”’. f, Stereo view of the drug transport pathway. Drug-interacting 
residues*"! are labelled and shown as magenta balls. The non-protected 
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residues, Tyr 118, Val 125, Val 133, Cys 137, Gln 195, Asn 296, Gly 300, 

Tyr 310, Phe 314, Ala729, Phe 759, Ser 766, Gly 774, Asn 842, Ala 871, Ser 943 
and Phe 957, are shown in grey''. g, Phe 335 in TM6 (red) interacts with 
residues in TM4 and TMS (yellow sticks), thereby stabilizing the 
inward-facing conformation. 
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drug-stimulated ATPase activity often show stronger resistance to the 
same drug, probably owing to an increased affinity for the drug. 
However, this correlation does not hold for the Phe335Ala mutant, 
which has a higher ATPase activity but confers lower or similar drug 
resistance’**. The structure shows that Phe 335 is located at the apex 
of the drug transport pathway, making van der Waals interactions with 
Tyr 310 and Phe 314 in TMS, and Ile 218 in TM4 (Fig. 4a, g). These 
contacts will be broken in the outward-facing conformation, in which 
the transport pathway has to be continuous with the extracellular side 
of the membrane”. Mutating the phenylalanine to alanine, if doing so 
destabilized the inward-facing conformation, would facilitate the 
transition to the outward-facing conformation and thus give rise to 
higher basal and drug-stimulated ATPase activities. In this description, 
the Phe335Ala mutant would be reminiscent of mutants identified in 
the maltose transporter that allow constitutive ATP hydrolysis in the 
absence of the binding protein’’. 

The structure of C. elegans P-gp provides an accurate model with 
which to interpret decades of functional and biochemical data on P-gp. 
The functional data complement the crystal structure to support a 
picture in which P-gp uses the energy from ATP hydrolysis to expel 
lipophilic molecules from the inner leaflet of the membrane. Many 
details of this molecule, such as how substrate binding activates the 
ATPase activity, how ATP hydrolysis is coupled to substrate flipping 
and how multiple substrates are recognized by a single transporter, 
remain to be elucidated. 


METHODS SUMMARY 


The gene encoding C. elegans P-gp was synthesized and optimized for expression 
in yeast and insect cells (Bio Basic Inc.). Native protein was expressed in Pichia 
pastoris strain SMD1163 and_ selenomethionine-substituted protein was 
expressed in Hi5 cells. The cytotoxicity assays were performed using baculo- 
virus-infected S. frugiperda Sf9 cells and the ATP/NADH coupled assays were 
carried out at 25°C. Crystals were obtained at 4°C by the vapour diffusion 
method. Diffraction data were collected at the Advanced Photon System at 
Argonne National Laboratory. The initial phase was obtained by molecular 
replacement (PHASER, CCP4) using separate domains of Sav1866 and mouse 
P-gp structures as search models. The molecular replacement phase was com- 
bined with single-wavelength anomalous dispersion phasing (PHASEREP, 
CCP4) to identify selenomethionine-substituted residues and mercury-labelled 
cysteines. The model was built in COOT and refined using CNS and REFMACS. 
The human P-gp model was generated using MODELLER (http://salilab.org/ 
modeller/about_modeller.html). 


Full Methods and any associated references are available in the online version of 
the paper. 
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LETTER 


METHODS 


Cloning, expression and purification. A synthetic gene encoding the full-length 
C. elegans P-gp (pgp-1; GenBank accession code, ABO1232.1) was subcloned into 
the pPICZ (Invitrogen) and pVL1393 (BD Biosciences) vectors. The carboxy 
terminus of P-gp was fused to enhanced green fluorescence protein (eGFP) plus 
a deca-histidine tag or the Protein A tag derived from pEZZ18 (GE Healthcare). 
A PreScission protease cleavage site was engineered between P-gp and the 
C-terminal tag. 

Native protein was expressed in P. pastoris strain SMD1163 (Invitrogen). Cells 
were grown at 28°C in minimal methanol medium and induced with 0.5% 
methanol for 24h. After collection by centrifugation (4,000g for 15 min), the cells 
were frozen in liquid nitrogen and broken by cryomilling (Retsch model MM400). 
Broken cells were resuspended in lysis buffer containing 50 mM Tris-HCl, pH 8.0, 
150mM NaCl, 20% glycerol, 5mM MgCl, 1mM PMSF, 1mM benzamidine, 
0.1 mg ml ' trypsin inhibitor, 3 pg ml DNase, 1 pg ml pepstatin A, leupeptin 
and aprotinin. Membranes were solubilized by adding 1% (w/v) n-dodecyl-B-p- 
maltopyranoside (DDM, Affymetrix) to the lysis buffer and incubating at 4 °C for 
2h. Insoluble membrane was removed by centrifugation at 80,000g for 40 min and 
supernatant was loaded onto cobalt affinity resin (Clontech). After on-column 
cleavage by PreScission protease, protein was eluted and reloaded onto a GST 
column to remove the GST-tagged PreScission protease. Protein was concentrated 
using an Amicon Ultra (MWCO 50K, Millipore) centrifugal device and further 
purified by Superdex 200 size-exclusion chromatography (GE Healthcare) in a 
buffer containing 20 mM MES, pH 6.5, 200 mM NaCl, 10% glycerol, 5 mM DTT 
and 0.02% (w/v) n-undecyl-f-p-thiomaltopyranoside (UDTM, Affymetrix). 

Selenomethionine-labelled protein was expressed in Hi5 cells (Invitrogen) 

adapted to methionine-free medium (Expression Systems). Baculovirus-infected 
Hi5 cells were cultured for 16-20 h at 28 °C, supplemented with 200 mg ~' L-(+)- 
selenomethione (Fisher Scientific), and then incubated for an additional 48 h. Cells 
were collected and broken using a high-pressure homogenizer (Emulsiflex-C5, 
Avestin) in the lysis buffer plus 5 mM DTT. Cell debris was removed by low-speed 
centrifugation (6,000g, 20min) and membranes were collected by high-speed 
centrifugation (200,000g, 1 h). The membranes were homogenized and solubilized 
with 1% (w/v) DDM for 2h. Extracted protein was purified using IgG sepharose 
affinity resin (GE Healthcare) followed by PreScission cleavage to remove the 
Protein A tag. The protein sample was further purified using GST affinity resin 
and gel-filtration chromatography. All purification steps were carried out at 4 °C. 
Cytotoxicity assay. Sf9 cells infected with recombinant baculovirus carrying the 
P-gp + eGFP gene were incubated at 28°C for 24h before actinomycin D or 
paclitaxel (Sigma-Aldrich) was added to the medium. Cells were monitored by 
counting the cell densities every 6h for 1d in the presence of actinomycin D or 
every 24h for 3d in the presence of paclitaxel. The percentage cell viability was 
calculated by dividing the number of live cells cultured with drug by that of the 
infected cells cultured without drug. In the presence of actinomycin D, cytotoxic 
effects were observed after 6h of drug exposure and reached a maximum at 12h. 
Paclitaxel-induced cell death was observed after 24 h of drug exposure and reached 
a maximum at 48h. P-glycoprotein expression was monitored by fluorescence 
microscopy. Each assay was repeated in triplicate. 
Preparation of microsomes. Cryomilled yeast cells were resuspended in the lysis 
buffer and the suspension was centrifuged at 4,000g for 15 min to remove the cell 
debris. The supernatant was ultracentrifuged at 200,000g for 1h and the pellet 
containing the microsome was stored at —80 °C. 


ATP hydrolysis assay. The ATPase activity of C. elegans P-gp was analysed using 
a coupled assay in which the regeneration of hydrolyzed ATP was coupled to the 
oxidation of NADH". Purified protein was added to the ATPase reaction buffer 
(50 mM potassium HEPES, pH 8.0, 10 mM MgCl, 60 pg ml} pyruvate kinase, 
32 1gml | lactate dehydrogenase, 4mM phosphoenolpyruvate, 0.3 mM NADH 
and 5mM ATP) at 25°C in the presence or absence of drug substrates. For 
vanadate inhibition, the protein samples were mixed with 1 mM vanadate in the 
presence of ATP-Mg for 10 min at room temperature (22 °C) before measuring 
the ATPase activity. To determine the drug-stimulated ATPase activity of P-gp in 
the membrane, microsomes (5 lg) were pre-incubated in the reaction buffer with 
actinomycin D or paclitaxel for 5 min at room temperature. The fold increase in 
drug stimulation was calculated by deducing the rate of ATP hydrolysis in the 
presence of vanadate (1 mM) from that in the absence of vanadate. 
Crystallization and data collection. Native crystals were obtained by mixing 1 pl 
of protein sample (10 mg ml‘) and 1 iil of reservoir solution containing 100 mM 
HEPES, pH 6.6-7.2, 200 mM sodium malonate and 19-22% PEG2000MME at 
4°C using the sitting-drop vapour diffusion method. Mercury-labelled crystals 
were prepared by adding 1 mM methyl mercury chloride (11) into crystal-containing 
drops for 4h at 4°C. Crystals of selenomethionine-substituted protein were 
obtained with 100mM HEPES, pH 6.6-7.2, 200mM ammonium phosphate 
monobasic and 19-22% PEG1500 at 4°C. All crystals appeared within 2d 
and continued to grow to full size in one week. Crystals were dehydrated and 
cryoprotected by a serial increase in PEG concentration in 5% steps in the reservoir 
(final 30%) for 48 h followed by 2.5% steps in the drop (final 35%) for 2 h. Crystals 
were flash-frozen in liquid nitrogen. Diffraction data were collected at the 23-ID 
beamline at the Advanced Photon System. Diffraction images were indexed, 
integrated and scaled with the HKL2000 package (HKL Research Inc.). 

Structure determination and homology modelling. The structure was 
determined by molecular replacement (PHASER, CCP4)* using separate domains 
of bacterial Sav1866 and mouse P-gp structures as search models. The molecular 
replacement phase was combined with single-wavelength anomalous dispersion 
phasing (PHASEREP, CCP4) to identify methionine and cysteine sites. The model 
was built by iterative modelling in COOT* and refined using CNS* and 
REFMACS5*. The crystallographic statistics are summarized in Supplementary 
Table 1. Residues of the N terminus (1-3, 52-54), the linker region (666-715) 
and the C terminus (1307-1321) were not visible in the electron density map and 
were not included in the final structure. No residues lie in the disallowed region of 
the Ramachandran plot. The homology model of human P-gp was generated by 
the program MODELLER on the basis of the structure of C. elegans P-gp in which 
the N-terminal region and the linker region were excluded because of low 
sequence conservation (Supplementary Fig. 9). All figures were generated using 
the program PyMOL (http://www.pymolLorg/). 
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Corrigendum: Neutralizing 
antibodies derived from the B cells 
of 1918 influenza pandemic 
survivors 
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Nature 455, 532-536 (2008); doi:10.1038/nature07231 


In this Letter, the heavy-chain sequence of 1F1 was in error and has 
been corrected in GenBank (EU169674.2). Also, the 1947 virus tested 
that was not inhibited by 1F1 should have been designated A/USA/ 
L3/1947(H1N1) rather than A/Fort Monmouth/1/1947(H1N1). In 
addition, 1F1 exhibits HAI activity of 1.3 pg ml‘ (not undetectable) 
against A/Fort Monmouth/1/1947 virus. These errors do not affect 
our main conclusions. These errors have been corrected in the HTML 
and PDF of the original paper (particularly Table 1 and Fig. 1) and in 
the original Supplementary Information. 
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Corrigendum: RNA sequencing of 
pancreatic circulating tumour cells 
implicates WNT signalling in 
metastasis 

Min Yu, David T. Ting, Shannon L. Stott, Ben S. Wittner, 

Fatih Ozsolak, Suchismita Paul, Jordan C. Ciciliano, 
Malgorzata E. Smas, Daniel Winokur, Anna J. Gilman, 
Matthew J. Ulman, Kristina Xega, Gianmarco Contino, 

Brinda Alagesan, Brian W. Brannigan, Patrice M. Milos, 

David P. Ryan, Lecia V. Sequist, Nabeel Bardeesy, 


Sridhar Ramaswamy, Mehmet Toner, Shyamala Maheswaran 
& Daniel A. Haber 


Nature 487, 510-513 (2012); doi:10.1038/nature11217 


In this Letter, we omitted the following accession information. The 
digital gene expression matrix and the Helicos single-molecule se- 
quence data from which it was derived are available at the NCBI 
GEO database with accession number GSE40176. A total of 44 samples 
were uploaded including 12 mouse and 32 human samples. Paired RNA 
sequencing data for epithelial cell adhesion molecule (EPCAM) and 
control immunoglobulin circulating tumour cell (CTC)-Chips from 
two pancreatic cancer mice, four human healthy donors, and 12 human 
metastatic pancreatic cancer patients are included in this data set. 
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CORRECTIONS & AMENDMENTS 


CORRIGENDUM 
doi:10.1038/nature11645 


Corrigendum: High-performance 
bulk thermoelectrics with all-scale 


hierarchical architectures 


Kanishka Biswas, Jiaqing He, Ivan D. Blum, Chun-IWu, 
Timothy P. Hogan, David N. Seidman, Vinayak P. Dravid 
& Mercouri G. Kanatzidis 


Nature 489, 414-418 (2012); doi:10.1038/nature11439 


In this Letter, the units of thermal conductivity on the y-axes of Fig. 2d 
and e should be Wm 'K ‘and the y-axis label of Fig. 3d should be 
“Count (%)’. Figures 2 and 3 of the original paper have been corrected 
online. 
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